Dataset statistics
| Number of variables | 38 |
|---|---|
| Number of observations | 39322 |
| Missing cells | 227433 |
| Missing cells (%) | 15.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 41.9 MiB |
| Average record size in memory | 1.1 KiB |
Variable types
| Categorical | 19 |
|---|---|
| DateTime | 2 |
| Numeric | 17 |
brand has a high cardinality: 329 distinct values | High cardinality |
model_code has a high cardinality: 38715 distinct values | High cardinality |
model_label has a high cardinality: 29558 distinct values | High cardinality |
commercial_label has a high cardinality: 4772 distinct values | High cardinality |
incorrect_fedas_code has a high cardinality: 2188 distinct values | High cardinality |
article_main_category has a high cardinality: 710 distinct values | High cardinality |
article_type has a high cardinality: 1221 distinct values | High cardinality |
article_detail has a high cardinality: 4076 distinct values | High cardinality |
comment has a high cardinality: 134 distinct values | High cardinality |
color_code has a high cardinality: 4399 distinct values | High cardinality |
color_label has a high cardinality: 12907 distinct values | High cardinality |
country_of_origin has a high cardinality: 74 distinct values | High cardinality |
country_of_manufacture has a high cardinality: 78 distinct values | High cardinality |
embakment_harbor has a high cardinality: 52 distinct values | High cardinality |
size has a high cardinality: 812 distinct values | High cardinality |
length is highly correlated with width and 1 other fields | High correlation |
width is highly correlated with length and 1 other fields | High correlation |
height is highly correlated with length and 1 other fields | High correlation |
minimum_multiple_of_order is highly correlated with net_weight | High correlation |
net_weight is highly correlated with minimum_multiple_of_order and 1 other fields | High correlation |
raw_weight is highly correlated with net_weight | High correlation |
correct_fedas_1 is highly correlated with incorrect_fedas_1 | High correlation |
incorrect_fedas_1 is highly correlated with correct_fedas_1 and 3 other fields | High correlation |
incorrect_fedas_2 is highly correlated with incorrect_fedas_1 and 2 other fields | High correlation |
incorrect_fedas_3 is highly correlated with incorrect_fedas_1 and 2 other fields | High correlation |
incorrect_fedas_4 is highly correlated with incorrect_fedas_1 and 2 other fields | High correlation |
length is highly correlated with width and 1 other fields | High correlation |
width is highly correlated with length | High correlation |
height is highly correlated with length | High correlation |
net_weight is highly correlated with raw_weight | High correlation |
raw_weight is highly correlated with net_weight | High correlation |
incorrect_fedas_1 is highly correlated with incorrect_fedas_2 and 1 other fields | High correlation |
incorrect_fedas_2 is highly correlated with incorrect_fedas_1 | High correlation |
incorrect_fedas_4 is highly correlated with incorrect_fedas_1 | High correlation |
length is highly correlated with width and 1 other fields | High correlation |
width is highly correlated with length and 1 other fields | High correlation |
height is highly correlated with length and 1 other fields | High correlation |
net_weight is highly correlated with raw_weight | High correlation |
raw_weight is highly correlated with net_weight | High correlation |
correct_fedas_1 is highly correlated with incorrect_fedas_1 | High correlation |
incorrect_fedas_1 is highly correlated with correct_fedas_1 and 1 other fields | High correlation |
incorrect_fedas_2 is highly correlated with incorrect_fedas_1 | High correlation |
embakment_harbor is highly correlated with country_of_manufacture and 3 other fields | High correlation |
inaccurate_gender is highly correlated with accurate_gender | High correlation |
country_of_manufacture is highly correlated with embakment_harbor and 1 other fields | High correlation |
eco_furniture is highly correlated with embakment_harbor | High correlation |
country_of_origin is highly correlated with embakment_harbor and 1 other fields | High correlation |
accurate_gender is highly correlated with inaccurate_gender and 1 other fields | High correlation |
correct_fedas_1 is highly correlated with embakment_harbor and 1 other fields | High correlation |
length is highly correlated with width and 3 other fields | High correlation |
width is highly correlated with length and 4 other fields | High correlation |
height is highly correlated with length and 3 other fields | High correlation |
inaccurate_gender is highly correlated with country_of_origin and 6 other fields | High correlation |
country_of_origin is highly correlated with width and 13 other fields | High correlation |
country_of_manufacture is highly correlated with width and 12 other fields | High correlation |
embakment_harbor is highly correlated with country_of_origin and 11 other fields | High correlation |
shipping_date is highly correlated with length and 6 other fields | High correlation |
eco_participation is highly correlated with eco_furniture | High correlation |
eco_furniture is highly correlated with eco_participation | High correlation |
net_weight is highly correlated with raw_weight | High correlation |
raw_weight is highly correlated with net_weight | High correlation |
volume is highly correlated with length and 4 other fields | High correlation |
accurate_gender is highly correlated with inaccurate_gender and 7 other fields | High correlation |
correct_fedas_1 is highly correlated with inaccurate_gender and 6 other fields | High correlation |
correct_fedas_2 is highly correlated with country_of_origin and 5 other fields | High correlation |
correct_fedas_3 is highly correlated with country_of_origin and 6 other fields | High correlation |
correct_fedas_4 is highly correlated with inaccurate_gender and 4 other fields | High correlation |
incorrect_fedas_1 is highly correlated with inaccurate_gender and 8 other fields | High correlation |
incorrect_fedas_2 is highly correlated with country_of_origin and 7 other fields | High correlation |
incorrect_fedas_3 is highly correlated with country_of_origin and 7 other fields | High correlation |
incorrect_fedas_4 is highly correlated with inaccurate_gender and 8 other fields | High correlation |
commercial_label has 33084 (84.1%) missing values | Missing |
article_main_category has 751 (1.9%) missing values | Missing |
article_type has 920 (2.3%) missing values | Missing |
article_detail has 9700 (24.7%) missing values | Missing |
comment has 37770 (96.1%) missing values | Missing |
avalability_start_date has 14414 (36.7%) missing values | Missing |
avalability_end_date has 17318 (44.0%) missing values | Missing |
color_code has 12645 (32.2%) missing values | Missing |
inaccurate_gender has 19630 (49.9%) missing values | Missing |
country_of_origin has 14402 (36.6%) missing values | Missing |
country_of_manufacture has 14402 (36.6%) missing values | Missing |
embakment_harbor has 36549 (92.9%) missing values | Missing |
shipping_date has 15834 (40.3%) missing values | Missing |
length is highly skewed (γ1 = 55.40474886) | Skewed |
width is highly skewed (γ1 = 88.89368975) | Skewed |
height is highly skewed (γ1 = 32.8271002) | Skewed |
eco_participation is highly skewed (γ1 = 22.62168) | Skewed |
minimum_multiple_of_order is highly skewed (γ1 = 23.04697745) | Skewed |
net_weight is highly skewed (γ1 = 77.07187952) | Skewed |
raw_weight is highly skewed (γ1 = 108.8920108) | Skewed |
volume is highly skewed (γ1 = 34.39569575) | Skewed |
model_code is uniformly distributed | Uniform |
commercial_label is uniformly distributed | Uniform |
length has 38334 (97.5%) zeros | Zeros |
width has 38235 (97.2%) zeros | Zeros |
height has 38279 (97.3%) zeros | Zeros |
eco_participation has 38117 (96.9%) zeros | Zeros |
multiple_of_order has 7440 (18.9%) zeros | Zeros |
minimum_multiple_of_order has 17829 (45.3%) zeros | Zeros |
net_weight has 26860 (68.3%) zeros | Zeros |
raw_weight has 30149 (76.7%) zeros | Zeros |
volume has 34170 (86.9%) zeros | Zeros |
correct_fedas_2 has 7602 (19.3%) zeros | Zeros |
correct_fedas_4 has 1309 (3.3%) zeros | Zeros |
incorrect_fedas_2 has 3649 (9.3%) zeros | Zeros |
incorrect_fedas_4 has 987 (2.5%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-11 11:55:55.555200 |
|---|---|
| Analysis finished | 2022-11-11 11:56:28.660920 |
| Duration | 33.11 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 329 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.5 MiB |
| brand_1 | |
|---|---|
| brand_293 | 1915 |
| brand_383 | 1737 |
| brand_56 | 1064 |
| brand_243 | 1028 |
| Other values (324) |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.495371548 |
| Min length | 7 |
Characters and Unicode
| Total characters | 334055 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 27 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | brand_293 |
|---|---|
| 2nd row | brand_3 |
| 3rd row | brand_265 |
| 4th row | brand_1 |
| 5th row | brand_12 |
Common Values
| Value | Count | Frequency (%) |
| brand_1 | 6089 | 15.5% |
| brand_293 | 1915 | 4.9% |
| brand_383 | 1737 | 4.4% |
| brand_56 | 1064 | 2.7% |
| brand_243 | 1028 | 2.6% |
| brand_102 | 934 | 2.4% |
| brand_194 | 919 | 2.3% |
| brand_285 | 692 | 1.8% |
| brand_288 | 681 | 1.7% |
| brand_175 | 665 | 1.7% |
| Other values (319) | 23598 |
Length
| Value | Count | Frequency (%) |
| brand_1 | 6089 | 15.5% |
| brand_293 | 1915 | 4.9% |
| brand_383 | 1737 | 4.4% |
| brand_56 | 1064 | 2.7% |
| brand_243 | 1028 | 2.6% |
| brand_102 | 934 | 2.4% |
| brand_194 | 919 | 2.3% |
| brand_285 | 692 | 1.8% |
| brand_288 | 681 | 1.7% |
| brand_175 | 665 | 1.7% |
| Other values (319) | 23598 |
Most occurring characters
| Value | Count | Frequency (%) |
| b | 39322 | |
| r | 39322 | |
| a | 39322 | |
| n | 39322 | |
| d | 39322 | |
| _ | 39322 | |
| 1 | 19580 | |
| 3 | 18242 | |
| 2 | 13777 | 4.1% |
| 9 | 7782 | 2.3% |
| Other values (6) | 38742 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 196610 | |
| Decimal Number | 98123 | |
| Connector Punctuation | 39322 | 11.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 19580 | |
| 3 | 18242 | |
| 2 | 13777 | |
| 9 | 7782 | 7.9% |
| 8 | 7686 | 7.8% |
| 4 | 7685 | 7.8% |
| 5 | 7527 | 7.7% |
| 7 | 6126 | 6.2% |
| 0 | 4871 | 5.0% |
| 6 | 4847 | 4.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| b | 39322 | |
| r | 39322 | |
| a | 39322 | |
| n | 39322 | |
| d | 39322 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 39322 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 196610 | |
| Common | 137445 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| _ | 39322 | |
| 1 | 19580 | |
| 3 | 18242 | |
| 2 | 13777 | 10.0% |
| 9 | 7782 | 5.7% |
| 8 | 7686 | 5.6% |
| 4 | 7685 | 5.6% |
| 5 | 7527 | 5.5% |
| 7 | 6126 | 4.5% |
| 0 | 4871 | 3.5% |
Latin
| Value | Count | Frequency (%) |
| b | 39322 | |
| r | 39322 | |
| a | 39322 | |
| n | 39322 | |
| d | 39322 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 334055 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| b | 39322 | |
| r | 39322 | |
| a | 39322 | |
| n | 39322 | |
| d | 39322 | |
| _ | 39322 | |
| 1 | 19580 | |
| 3 | 18242 | |
| 2 | 13777 | 4.1% |
| 9 | 7782 | 2.3% |
| Other values (6) | 38742 |
| Distinct | 38715 |
|---|---|
| Distinct (%) | 98.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.4 MiB |
| 813271-40 | 4 |
|---|---|
| 1865971 | 3 |
| 0494394 | 3 |
| 813800-40 | 3 |
| 214304 | 3 |
| Other values (38710) |
Length
| Max length | 21 |
|---|---|
| Median length | 18 |
| Mean length | 7.689893698 |
| Min length | 2 |
Characters and Unicode
| Total characters | 302382 |
|---|---|
| Distinct characters | 43 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 38133 ? |
|---|---|
| Unique (%) | 97.0% |
Sample
| 1st row | S42783 |
|---|---|
| 2nd row | R1252 |
| 3rd row | OXS917808 |
| 4th row | GM5253 |
| 5th row | MS338 |
Common Values
| Value | Count | Frequency (%) |
| 813271-40 | 4 | < 0.1% |
| 1865971 | 3 | < 0.1% |
| 0494394 | 3 | < 0.1% |
| 813800-40 | 3 | < 0.1% |
| 214304 | 3 | < 0.1% |
| 813750-40 | 3 | < 0.1% |
| 813721-40 | 3 | < 0.1% |
| 813320-40 | 3 | < 0.1% |
| 1021 | 3 | < 0.1% |
| 813180-40 | 3 | < 0.1% |
| Other values (38705) | 39291 |
Length
| Value | Count | Frequency (%) |
| int | 13 | < 0.1% |
| j | 8 | < 0.1% |
| 1 | 5 | < 0.1% |
| 813271-40 | 4 | < 0.1% |
| e | 4 | < 0.1% |
| serraline | 3 | < 0.1% |
| pro | 3 | < 0.1% |
| kt | 3 | < 0.1% |
| 813380-40 | 3 | < 0.1% |
| ridge | 3 | < 0.1% |
| Other values (38722) | 39332 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 45054 | |
| 1 | 32947 | |
| 2 | 23988 | 7.9% |
| 3 | 20221 | 6.7% |
| 5 | 18073 | 6.0% |
| 6 | 17491 | 5.8% |
| 4 | 16861 | 5.6% |
| 8 | 16676 | 5.5% |
| 7 | 15812 | 5.2% |
| 9 | 14392 | 4.8% |
| Other values (33) | 80867 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 221515 | |
| Uppercase Letter | 75116 | 24.8% |
| Dash Punctuation | 3752 | 1.2% |
| Other Punctuation | 1915 | 0.6% |
| Space Separator | 62 | < 0.1% |
| Connector Punctuation | 20 | < 0.1% |
| Lowercase Letter | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 6092 | 8.1% |
| A | 5128 | 6.8% |
| G | 4833 | 6.4% |
| W | 4535 | 6.0% |
| E | 4053 | 5.4% |
| D | 3987 | 5.3% |
| B | 3667 | 4.9% |
| M | 3626 | 4.8% |
| S | 3424 | 4.6% |
| I | 3159 | 4.2% |
| Other values (16) | 32612 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 45054 | |
| 1 | 32947 | |
| 2 | 23988 | |
| 3 | 20221 | |
| 5 | 18073 | |
| 6 | 17491 | 7.9% |
| 4 | 16861 | 7.6% |
| 8 | 16676 | 7.5% |
| 7 | 15812 | 7.1% |
| 9 | 14392 | 6.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1410 | |
| / | 505 | 26.4% |
Lowercase Letter
| Value | Count | Frequency (%) |
| p | 1 | |
| l | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3752 |
Space Separator
| Value | Count | Frequency (%) |
| 62 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 20 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 227264 | |
| Latin | 75118 | 24.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| F | 6092 | 8.1% |
| A | 5128 | 6.8% |
| G | 4833 | 6.4% |
| W | 4535 | 6.0% |
| E | 4053 | 5.4% |
| D | 3987 | 5.3% |
| B | 3667 | 4.9% |
| M | 3626 | 4.8% |
| S | 3424 | 4.6% |
| I | 3159 | 4.2% |
| Other values (18) | 32614 |
Common
| Value | Count | Frequency (%) |
| 0 | 45054 | |
| 1 | 32947 | |
| 2 | 23988 | |
| 3 | 20221 | |
| 5 | 18073 | |
| 6 | 17491 | 7.7% |
| 4 | 16861 | 7.4% |
| 8 | 16676 | 7.3% |
| 7 | 15812 | 7.0% |
| 9 | 14392 | 6.3% |
| Other values (5) | 5749 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 302382 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 45054 | |
| 1 | 32947 | |
| 2 | 23988 | 7.9% |
| 3 | 20221 | 6.7% |
| 5 | 18073 | 6.0% |
| 6 | 17491 | 5.8% |
| 4 | 16861 | 5.6% |
| 8 | 16676 | 5.5% |
| 7 | 15812 | 5.2% |
| 9 | 14392 | 4.8% |
| Other values (33) | 80867 |
| Distinct | 29558 |
|---|---|
| Distinct (%) | 75.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.9 MiB |
| MAN JEANS | 86 |
|---|---|
| VESTE | 77 |
| WOMAN JEANS | 64 |
| CREWNECK T-SHIRT | 61 |
| CHUCK TAYLOR ALL STAR | 47 |
| Other values (29553) |
Length
| Max length | 24518 |
|---|---|
| Median length | 26 |
| Mean length | 18.81748131 |
| Min length | 1 |
Characters and Unicode
| Total characters | 739941 |
|---|---|
| Distinct characters | 104 |
| Distinct categories | 14 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 25415 ? |
|---|---|
| Unique (%) | 64.6% |
Sample
| 1st row | FLEXAGON ENERGY TR 3.0 MT |
|---|---|
| 2nd row | TADEN PLUS FUR |
| 3rd row | POCHETTE PORTE TRAVERS PE |
| 4th row | CLUB KNOT TANK |
| 5th row | BONITA DK PNK/BLCK M |
Common Values
| Value | Count | Frequency (%) |
| MAN JEANS | 86 | 0.2% |
| VESTE | 77 | 0.2% |
| WOMAN JEANS | 64 | 0.2% |
| CREWNECK T-SHIRT | 61 | 0.2% |
| CHUCK TAYLOR ALL STAR | 47 | 0.1% |
| CL LTHR | 40 | 0.1% |
| TEE | 39 | 0.1% |
| BIKINI | 35 | 0.1% |
| REEBOK ROYAL GLIDE | 28 | 0.1% |
| REEBOK ROYAL GLIDE RPLCLP | 28 | 0.1% |
| Other values (29548) | 38817 |
Length
| Value | Count | Frequency (%) |
| w | 2278 | 1.7% |
| m | 2016 | 1.5% |
| tee | 1594 | 1.2% |
| jacket | 892 | 0.7% |
| j | 789 | 0.6% |
| ss | 763 | 0.6% |
| top | 750 | 0.5% |
| jr | 728 | 0.5% |
| 2.0 | 692 | 0.5% |
| short | 680 | 0.5% |
| Other values (18443) | 125990 |
Most occurring characters
| Value | Count | Frequency (%) |
| 98108 | 13.3% | |
| E | 61815 | 8.4% |
| A | 46466 | 6.3% |
| T | 44884 | 6.1% |
| R | 41573 | 5.6% |
| S | 40283 | 5.4% |
| O | 40012 | 5.4% |
| I | 36243 | 4.9% |
| L | 34825 | 4.7% |
| N | 32549 | 4.4% |
| Other values (94) | 263183 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 591698 | |
| Space Separator | 98135 | 13.3% |
| Decimal Number | 33941 | 4.6% |
| Other Punctuation | 10931 | 1.5% |
| Dash Punctuation | 3051 | 0.4% |
| Lowercase Letter | 534 | 0.1% |
| Math Symbol | 461 | 0.1% |
| Open Punctuation | 322 | < 0.1% |
| Control | 309 | < 0.1% |
| Close Punctuation | 269 | < 0.1% |
| Other values (4) | 290 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 61815 | 10.4% |
| A | 46466 | 7.9% |
| T | 44884 | 7.6% |
| R | 41573 | 7.0% |
| S | 40283 | 6.8% |
| O | 40012 | 6.8% |
| I | 36243 | 6.1% |
| L | 34825 | 5.9% |
| N | 32549 | 5.5% |
| C | 26701 | 4.5% |
| Other values (27) | 186347 |
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 143 | |
| h | 140 | |
| l | 139 | |
| s | 24 | 4.5% |
| o | 19 | 3.6% |
| e | 10 | 1.9% |
| r | 8 | 1.5% |
| c | 7 | 1.3% |
| a | 6 | 1.1% |
| t | 5 | 0.9% |
| Other values (14) | 33 | 6.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 5510 | |
| . | 2177 | 19.9% |
| / | 1978 | 18.1% |
| ' | 763 | 7.0% |
| , | 167 | 1.5% |
| & | 138 | 1.3% |
| \ | 85 | 0.8% |
| " | 51 | 0.5% |
| ? | 24 | 0.2% |
| # | 13 | 0.1% |
| Other values (3) | 25 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8448 | |
| 2 | 5780 | |
| 1 | 5394 | |
| 3 | 4003 | |
| 5 | 2326 | 6.9% |
| 4 | 1802 | 5.3% |
| 9 | 1728 | 5.1% |
| 6 | 1668 | 4.9% |
| 7 | 1495 | 4.4% |
| 8 | 1297 | 3.8% |
Control
| Value | Count | Frequency (%) |
| 145 | ||
| 145 | ||
| | 11 | 3.6% |
| | 5 | 1.6% |
| | 3 | 1.0% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 449 | |
| > | 11 | 2.4% |
| | | 1 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 98108 | ||
| 27 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 295 | |
| [ | 27 | 8.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 242 | |
| ] | 27 | 10.0% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 206 | |
| ® | 20 | 8.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3051 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 56 |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 6 |
Other Number
| Value | Count | Frequency (%) |
| ² | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 592232 | |
| Common | 147709 | 20.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 61815 | 10.4% |
| A | 46466 | 7.8% |
| T | 44884 | 7.6% |
| R | 41573 | 7.0% |
| S | 40283 | 6.8% |
| O | 40012 | 6.8% |
| I | 36243 | 6.1% |
| L | 34825 | 5.9% |
| N | 32549 | 5.5% |
| C | 26701 | 4.5% |
| Other values (51) | 186881 |
Common
| Value | Count | Frequency (%) |
| 98108 | ||
| 0 | 8448 | 5.7% |
| 2 | 5780 | 3.9% |
| ; | 5510 | 3.7% |
| 1 | 5394 | 3.7% |
| 3 | 4003 | 2.7% |
| - | 3051 | 2.1% |
| 5 | 2326 | 1.6% |
| . | 2177 | 1.5% |
| / | 1978 | 1.3% |
| Other values (33) | 10934 | 7.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 739598 | |
| None | 343 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 98108 | 13.3% | |
| E | 61815 | 8.4% |
| A | 46466 | 6.3% |
| T | 44884 | 6.1% |
| R | 41573 | 5.6% |
| S | 40283 | 5.4% |
| O | 40012 | 5.4% |
| I | 36243 | 4.9% |
| L | 34825 | 4.7% |
| N | 32549 | 4.4% |
| Other values (75) | 262840 |
None
| Value | Count | Frequency (%) |
| ° | 206 | |
| É | 29 | 8.5% |
| 27 | 7.9% | |
| ® | 20 | 5.8% |
| Ã | 13 | 3.8% |
| | 11 | 3.2% |
| « | 6 | 1.7% |
| À | 5 | 1.5% |
| | 5 | 1.5% |
| Â | 4 | 1.2% |
| Other values (9) | 17 | 5.0% |
| Distinct | 4772 |
|---|---|
| Distinct (%) | 76.5% |
| Missing | 33084 |
| Missing (%) | 84.1% |
| Memory size | 1.4 MiB |
| TBT_AP_MN TOP | 20 |
|---|---|
| GM500 D | 15 |
| SLENDER | 13 |
| PC574 M | 13 |
| PALM BEACH | 13 |
| Other values (4767) |
Length
| Max length | 25 |
|---|---|
| Median length | 22 |
| Mean length | 16.5801539 |
| Min length | 1 |
Characters and Unicode
| Total characters | 103427 |
|---|---|
| Distinct characters | 61 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 3926 ? |
|---|---|
| Unique (%) | 62.9% |
Sample
| 1st row | OMEGAS 160 CAPSULES |
|---|---|
| 2nd row | JORDAN LEGEND ANKLE 6PK |
| 3rd row | SHORT DE MUAY THAI VENUM |
| 4th row | FOOT T5 TRAINER FC BARCEL |
| 5th row | PAGAIE KAYAK SYMETRIQUE, |
Common Values
| Value | Count | Frequency (%) |
| TBT_AP_MN TOP | 20 | 0.1% |
| GM500 D | 15 | < 0.1% |
| SLENDER | 13 | < 0.1% |
| PC574 M | 13 | < 0.1% |
| PALM BEACH | 13 | < 0.1% |
| CLASH | 13 | < 0.1% |
| YV574 M | 12 | < 0.1% |
| POLO MANCHES COURTES | 12 | < 0.1% |
| TEE | 12 | < 0.1% |
| GC574 M | 11 | < 0.1% |
| Other values (4762) | 6104 | 15.5% |
| (Missing) | 33084 |
Length
| Value | Count | Frequency (%) |
| icepeak | 525 | 2.9% |
| de | 333 | 1.8% |
| 284 | 1.6% | |
| m | 218 | 1.2% |
| luhta | 199 | 1.1% |
| jr | 192 | 1.0% |
| homme | 160 | 0.9% |
| d | 145 | 0.8% |
| tee | 135 | 0.7% |
| a | 131 | 0.7% |
| Other values (5137) | 15967 |
Most occurring characters
| Value | Count | Frequency (%) |
| 12301 | 11.9% | |
| E | 10053 | 9.7% |
| A | 7295 | 7.1% |
| T | 6091 | 5.9% |
| S | 5457 | 5.3% |
| O | 5405 | 5.2% |
| R | 5110 | 4.9% |
| I | 5099 | 4.9% |
| L | 4706 | 4.6% |
| N | 4432 | 4.3% |
| Other values (51) | 37478 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 82662 | |
| Space Separator | 12315 | 11.9% |
| Decimal Number | 7062 | 6.8% |
| Dash Punctuation | 617 | 0.6% |
| Other Punctuation | 562 | 0.5% |
| Math Symbol | 80 | 0.1% |
| Connector Punctuation | 60 | 0.1% |
| Open Punctuation | 28 | < 0.1% |
| Lowercase Letter | 13 | < 0.1% |
| Other Symbol | 13 | < 0.1% |
| Other values (2) | 15 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 10053 | |
| A | 7295 | 8.8% |
| T | 6091 | 7.4% |
| S | 5457 | 6.6% |
| O | 5405 | 6.5% |
| R | 5110 | 6.2% |
| I | 5099 | 6.2% |
| L | 4706 | 5.7% |
| N | 4432 | 5.4% |
| C | 4063 | 4.9% |
| Other values (19) | 24951 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1715 | |
| 3 | 1096 | |
| 1 | 1075 | |
| 2 | 845 | |
| 5 | 628 | 8.9% |
| 9 | 379 | 5.4% |
| 4 | 378 | 5.4% |
| 7 | 362 | 5.1% |
| 6 | 299 | 4.2% |
| 8 | 285 | 4.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 252 | |
| . | 131 | |
| , | 93 | 16.5% |
| ' | 65 | 11.6% |
| & | 14 | 2.5% |
| " | 5 | 0.9% |
| % | 2 | 0.4% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 10 | |
| u | 1 | 7.7% |
| r | 1 | 7.7% |
| f | 1 | 7.7% |
Space Separator
| Value | Count | Frequency (%) |
| 12301 | ||
| 14 | 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 63 | |
| > | 17 | 21.2% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 8 | |
| ® | 5 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 617 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 60 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 28 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 10 |
Control
| Value | Count | Frequency (%) |
| | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 82675 | |
| Common | 20752 | 20.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 10053 | |
| A | 7295 | 8.8% |
| T | 6091 | 7.4% |
| S | 5457 | 6.6% |
| O | 5405 | 6.5% |
| R | 5110 | 6.2% |
| I | 5099 | 6.2% |
| L | 4706 | 5.7% |
| N | 4432 | 5.4% |
| C | 4063 | 4.9% |
| Other values (23) | 24964 |
Common
| Value | Count | Frequency (%) |
| 12301 | ||
| 0 | 1715 | 8.3% |
| 3 | 1096 | 5.3% |
| 1 | 1075 | 5.2% |
| 2 | 845 | 4.1% |
| 5 | 628 | 3.0% |
| - | 617 | 3.0% |
| 9 | 379 | 1.8% |
| 4 | 378 | 1.8% |
| 7 | 362 | 1.7% |
| Other values (18) | 1356 | 6.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 103391 | |
| None | 36 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 12301 | 11.9% | |
| E | 10053 | 9.7% |
| A | 7295 | 7.1% |
| T | 6091 | 5.9% |
| S | 5457 | 5.3% |
| O | 5405 | 5.2% |
| R | 5110 | 4.9% |
| I | 5099 | 4.9% |
| L | 4706 | 4.6% |
| N | 4432 | 4.3% |
| Other values (44) | 37442 |
None
| Value | Count | Frequency (%) |
| 14 | ||
| ° | 8 | |
| ® | 5 | 13.9% |
| | 5 | 13.9% |
| É | 2 | 5.6% |
| À | 1 | 2.8% |
| È | 1 | 2.8% |
| Distinct | 2188 |
|---|---|
| Distinct (%) | 5.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| 275124 | 467 |
|---|---|
| 375311 | 418 |
| 375313 | 393 |
| 278125 | 307 |
| Other values (2183) |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 4.343827883 |
| Min length | 0 |
Characters and Unicode
| Total characters | 170808 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 550 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | 378011 |
|---|---|
| 2nd row | |
| 3rd row | 175897 |
| 4th row | 224122 |
| 5th row |
Common Values
| Value | Count | Frequency (%) |
| 10854 | ||
| 275124 | 467 | 1.2% |
| 375311 | 418 | 1.1% |
| 375313 | 393 | 1.0% |
| 278125 | 307 | 0.8% |
| 232377 | 288 | 0.7% |
| 375312 | 282 | 0.7% |
| 275121 | 258 | 0.7% |
| 275125 | 256 | 0.7% |
| 232904 | 253 | 0.6% |
| Other values (2178) | 25546 |
Length
| Value | Count | Frequency (%) |
| 275124 | 467 | 1.6% |
| 375311 | 418 | 1.5% |
| 375313 | 393 | 1.4% |
| 278125 | 307 | 1.1% |
| 232377 | 288 | 1.0% |
| 375312 | 282 | 1.0% |
| 275121 | 258 | 0.9% |
| 275125 | 256 | 0.9% |
| 232904 | 253 | 0.9% |
| 375963 | 243 | 0.9% |
| Other values (2177) | 25303 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 31425 | |
| 1 | 23437 | |
| 3 | 21305 | |
| 7 | 20212 | |
| 0 | 17128 | |
| 5 | 16053 | |
| 4 | 12109 | 7.1% |
| 9 | 10156 | 5.9% |
| 8 | 9848 | 5.8% |
| 6 | 9135 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 170808 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 31425 | |
| 1 | 23437 | |
| 3 | 21305 | |
| 7 | 20212 | |
| 0 | 17128 | |
| 5 | 16053 | |
| 4 | 12109 | 7.1% |
| 9 | 10156 | 5.9% |
| 8 | 9848 | 5.8% |
| 6 | 9135 | 5.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 170808 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 31425 | |
| 1 | 23437 | |
| 3 | 21305 | |
| 7 | 20212 | |
| 0 | 17128 | |
| 5 | 16053 | |
| 4 | 12109 | 7.1% |
| 9 | 10156 | 5.9% |
| 8 | 9848 | 5.8% |
| 6 | 9135 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 170808 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 31425 | |
| 1 | 23437 | |
| 3 | 21305 | |
| 7 | 20212 | |
| 0 | 17128 | |
| 5 | 16053 | |
| 4 | 12109 | 7.1% |
| 9 | 10156 | 5.9% |
| 8 | 9848 | 5.8% |
| 6 | 9135 | 5.3% |
| Distinct | 710 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 751 |
| Missing (%) | 1.9% |
| Memory size | 2.5 MiB |
| LOISIRS | |
|---|---|
| FOOTBALL | 2518 |
| TRAINING | 2498 |
| SPORTSTYLE | 2310 |
| LOISIR | 1770 |
| Other values (705) |
Length
| Max length | 35 |
|---|---|
| Median length | 33 |
| Mean length | 10.01521869 |
| Min length | 2 |
Characters and Unicode
| Total characters | 386297 |
|---|---|
| Distinct characters | 50 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 145 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | TRAINING |
|---|---|
| 2nd row | GARDEN |
| 3rd row | SAC |
| 4th row | RACKET SPORTS |
| 5th row | TARTAN CHECKS |
Common Values
| Value | Count | Frequency (%) |
| LOISIRS | 2855 | 7.3% |
| FOOTBALL | 2518 | 6.4% |
| TRAINING | 2498 | 6.4% |
| SPORTSTYLE | 2310 | 5.9% |
| LOISIR | 1770 | 4.5% |
| RUNNING | 1263 | 3.2% |
| APPAREL | 1063 | 2.7% |
| OUTDOOR | 945 | 2.4% |
| MULTISPORT | 845 | 2.1% |
| COLLECTIVITES | 748 | 1.9% |
| Other values (700) | 21756 | |
| (Missing) | 751 | 1.9% |
Length
| Value | Count | Frequency (%) |
| loisirs | 3132 | 6.1% |
| football | 2563 | 5.0% |
| training | 2549 | 4.9% |
| sports | 2384 | 4.6% |
| sportstyle | 2310 | 4.5% |
| loisir | 1906 | 3.7% |
| textile | 1613 | 3.1% |
| running | 1441 | 2.8% |
| apparel | 1428 | 2.8% |
| outdoor | 1388 | 2.7% |
| Other values (638) | 30792 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 39455 | |
| I | 35023 | 9.1% |
| E | 34986 | 9.1% |
| O | 34666 | 9.0% |
| T | 34137 | 8.8% |
| R | 32467 | 8.4% |
| L | 27171 | 7.0% |
| A | 23171 | 6.0% |
| N | 20988 | 5.4% |
| 13282 | 3.4% | |
| Other values (40) | 90951 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 370373 | |
| Space Separator | 13282 | 3.4% |
| Other Punctuation | 2102 | 0.5% |
| Dash Punctuation | 371 | 0.1% |
| Decimal Number | 124 | < 0.1% |
| Math Symbol | 26 | < 0.1% |
| Lowercase Letter | 7 | < 0.1% |
| Open Punctuation | 6 | < 0.1% |
| Close Punctuation | 6 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 39455 | |
| I | 35023 | |
| E | 34986 | |
| O | 34666 | |
| T | 34137 | |
| R | 32467 | |
| L | 27171 | 7.3% |
| A | 23171 | 6.3% |
| N | 20988 | 5.7% |
| P | 12526 | 3.4% |
| Other values (17) | 75783 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 68 | |
| 2 | 27 | 21.8% |
| 3 | 18 | 14.5% |
| 8 | 3 | 2.4% |
| 6 | 3 | 2.4% |
| 5 | 3 | 2.4% |
| 4 | 2 | 1.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1562 | |
| & | 361 | 17.2% |
| . | 66 | 3.1% |
| , | 42 | 2.0% |
| ? | 36 | 1.7% |
| ' | 35 | 1.7% |
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 2 | |
| l | 2 | |
| r | 1 | |
| e | 1 | |
| s | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 13282 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 371 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 26 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 6 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 370380 | |
| Common | 15917 | 4.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 39455 | |
| I | 35023 | |
| E | 34986 | |
| O | 34666 | |
| T | 34137 | |
| R | 32467 | |
| L | 27171 | 7.3% |
| A | 23171 | 6.3% |
| N | 20988 | 5.7% |
| P | 12526 | 3.4% |
| Other values (22) | 75790 |
Common
| Value | Count | Frequency (%) |
| 13282 | ||
| / | 1562 | 9.8% |
| - | 371 | 2.3% |
| & | 361 | 2.3% |
| 1 | 68 | 0.4% |
| . | 66 | 0.4% |
| , | 42 | 0.3% |
| ? | 36 | 0.2% |
| ' | 35 | 0.2% |
| 2 | 27 | 0.2% |
| Other values (8) | 67 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 386261 | |
| None | 36 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 39455 | |
| I | 35023 | 9.1% |
| E | 34986 | 9.1% |
| O | 34666 | 9.0% |
| T | 34137 | 8.8% |
| R | 32467 | 8.4% |
| L | 27171 | 7.0% |
| A | 23171 | 6.0% |
| N | 20988 | 5.4% |
| 13282 | 3.4% | |
| Other values (39) | 90915 |
None
| Value | Count | Frequency (%) |
| Ã | 36 |
| Distinct | 1221 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 920 |
| Missing (%) | 2.3% |
| Memory size | 2.5 MiB |
| HOMME | |
|---|---|
| FEMME | |
| UNISEXE ADULTE | 1471 |
| SHOES - LOW (NON FOOTBALL) | 851 |
| MEN | 845 |
| Other values (1216) |
Length
| Max length | 34 |
|---|---|
| Median length | 31 |
| Mean length | 9.292693089 |
| Min length | 1 |
Characters and Unicode
| Total characters | 356858 |
|---|---|
| Distinct characters | 63 |
| Distinct categories | 11 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 317 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | HOMME |
|---|---|
| 2nd row | RUBBER BOOTS |
| 3rd row | HOMME |
| 4th row | FEMME |
| 5th row | MEN |
Common Values
| Value | Count | Frequency (%) |
| HOMME | 3906 | 9.9% |
| FEMME | 3470 | 8.8% |
| UNISEXE ADULTE | 1471 | 3.7% |
| SHOES - LOW (NON FOOTBALL) | 851 | 2.2% |
| MEN | 845 | 2.1% |
| GARCON | 764 | 1.9% |
| VESTE | 721 | 1.8% |
| WOMEN | 642 | 1.6% |
| UNISEXE ENFANT | 580 | 1.5% |
| UNISEX | 572 | 1.5% |
| Other values (1211) | 24580 | |
| (Missing) | 920 | 2.3% |
Length
| Value | Count | Frequency (%) |
| homme | 3915 | 6.8% |
| femme | 3473 | 6.0% |
| unisexe | 2262 | 3.9% |
| adulte | 1481 | 2.6% |
| shoes | 1275 | 2.2% |
| unisex | 1067 | 1.9% |
| 1028 | 1.8% | |
| men | 1000 | 1.7% |
| football | 948 | 1.6% |
| veste | 880 | 1.5% |
| Other values (1095) | 40290 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 52712 | |
| S | 33070 | 9.3% |
| T | 26057 | 7.3% |
| A | 22992 | 6.4% |
| O | 22781 | 6.4% |
| M | 22543 | 6.3% |
| N | 21520 | 6.0% |
| 19217 | 5.4% | |
| R | 15557 | 4.4% |
| L | 14477 | 4.1% |
| Other values (53) | 105932 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 327898 | |
| Space Separator | 19217 | 5.4% |
| Dash Punctuation | 2450 | 0.7% |
| Open Punctuation | 2017 | 0.6% |
| Close Punctuation | 1884 | 0.5% |
| Other Punctuation | 1770 | 0.5% |
| Decimal Number | 1523 | 0.4% |
| Lowercase Letter | 92 | < 0.1% |
| Math Symbol | 4 | < 0.1% |
| Modifier Symbol | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 52712 | |
| S | 33070 | |
| T | 26057 | 7.9% |
| A | 22992 | 7.0% |
| O | 22781 | 6.9% |
| M | 22543 | 6.9% |
| N | 21520 | 6.6% |
| R | 15557 | 4.7% |
| L | 14477 | 4.4% |
| I | 14215 | 4.3% |
| Other values (17) | 81974 |
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 14 | |
| r | 12 | |
| a | 11 | |
| d | 10 | |
| e | 10 | |
| o | 10 | |
| b | 6 | |
| é | 6 | |
| n | 6 | |
| c | 3 | 3.3% |
| Other values (4) | 4 | 4.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 639 | |
| 0 | 309 | |
| 5 | 257 | |
| 2 | 118 | 7.7% |
| 9 | 91 | 6.0% |
| 4 | 43 | 2.8% |
| 3 | 39 | 2.6% |
| 6 | 20 | 1.3% |
| 7 | 5 | 0.3% |
| 8 | 2 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1527 | |
| & | 113 | 6.4% |
| ' | 106 | 6.0% |
| , | 22 | 1.2% |
| . | 2 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 19217 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2450 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2017 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1884 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 4 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 2 |
Control
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 327990 | |
| Common | 28868 | 8.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 52712 | |
| S | 33070 | |
| T | 26057 | 7.9% |
| A | 22992 | 7.0% |
| O | 22781 | 6.9% |
| M | 22543 | 6.9% |
| N | 21520 | 6.6% |
| R | 15557 | 4.7% |
| L | 14477 | 4.4% |
| I | 14215 | 4.3% |
| Other values (31) | 82066 |
Common
| Value | Count | Frequency (%) |
| 19217 | ||
| - | 2450 | 8.5% |
| ( | 2017 | 7.0% |
| ) | 1884 | 6.5% |
| / | 1527 | 5.3% |
| 1 | 639 | 2.2% |
| 0 | 309 | 1.1% |
| 5 | 257 | 0.9% |
| 2 | 118 | 0.4% |
| & | 113 | 0.4% |
| Other values (12) | 337 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 356847 | |
| None | 11 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 52712 | |
| S | 33070 | 9.3% |
| T | 26057 | 7.3% |
| A | 22992 | 6.4% |
| O | 22781 | 6.4% |
| M | 22543 | 6.3% |
| N | 21520 | 6.0% |
| 19217 | 5.4% | |
| R | 15557 | 4.4% |
| L | 14477 | 4.1% |
| Other values (49) | 105921 |
None
| Value | Count | Frequency (%) |
| é | 6 | |
| Ê | 2 | 18.2% |
| ´ | 2 | 18.2% |
| | 1 | 9.1% |
| Distinct | 4076 |
|---|---|
| Distinct (%) | 13.8% |
| Missing | 9700 |
| Missing (%) | 24.7% |
| Memory size | 2.2 MiB |
| 09-SHOES (LOW) | 1436 |
|---|---|
| ADULT MALE | 1125 |
| ADULT FEMALE | 950 |
| 30-JERSEY | 339 |
| ADULT UNISEX | 321 |
| Other values (4071) |
Length
| Max length | 35 |
|---|---|
| Median length | 29 |
| Mean length | 11.53443387 |
| Min length | 1 |
Characters and Unicode
| Total characters | 341673 |
|---|---|
| Distinct characters | 75 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 2606 ? |
|---|---|
| Unique (%) | 8.8% |
Sample
| 1st row | 09-SHOES (LOW) |
|---|---|
| 2nd row | BOOTS |
| 3rd row | N1FARROW |
| 4th row | 21-TANK |
| 5th row | T-SHIRT |
Common Values
| Value | Count | Frequency (%) |
| 09-SHOES (LOW) | 1436 | 3.7% |
| ADULT MALE | 1125 | 2.9% |
| ADULT FEMALE | 950 | 2.4% |
| 30-JERSEY | 339 | 0.9% |
| ADULT UNISEX | 321 | 0.8% |
| MANCHE LONGUE | 306 | 0.8% |
| TEE SHIRT MC | 279 | 0.7% |
| DENIM PANTS | 276 | 0.7% |
| 44-PANTS (1/1) | 266 | 0.7% |
| PANTALON | 242 | 0.6% |
| Other values (4066) | 24082 | |
| (Missing) | 9700 |
Length
| Value | Count | Frequency (%) |
| adult | 2396 | 4.4% |
| low | 1695 | 3.1% |
| 09-shoes | 1436 | 2.6% |
| male | 1352 | 2.5% |
| female | 1063 | 1.9% |
| de | 757 | 1.4% |
| kids | 733 | 1.3% |
| short | 724 | 1.3% |
| unisex | 680 | 1.2% |
| top | 662 | 1.2% |
| Other values (3557) | 43015 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 35249 | 10.3% |
| S | 28516 | 8.3% |
| T | 25151 | 7.4% |
| 25121 | 7.4% | |
| A | 24031 | 7.0% |
| O | 20044 | 5.9% |
| L | 19072 | 5.6% |
| R | 16000 | 4.7% |
| I | 15638 | 4.6% |
| N | 13097 | 3.8% |
| Other values (65) | 119754 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 286481 | |
| Space Separator | 25124 | 7.4% |
| Decimal Number | 15849 | 4.6% |
| Dash Punctuation | 6317 | 1.8% |
| Open Punctuation | 2223 | 0.7% |
| Close Punctuation | 2221 | 0.7% |
| Other Punctuation | 2120 | 0.6% |
| Lowercase Letter | 1248 | 0.4% |
| Math Symbol | 63 | < 0.1% |
| Modifier Symbol | 17 | < 0.1% |
| Other values (2) | 10 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 35249 | |
| S | 28516 | 10.0% |
| T | 25151 | 8.8% |
| A | 24031 | 8.4% |
| O | 20044 | 7.0% |
| L | 19072 | 6.7% |
| R | 16000 | 5.6% |
| I | 15638 | 5.5% |
| N | 13097 | 4.6% |
| C | 10597 | 3.7% |
| Other values (20) | 79086 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 155 | |
| a | 145 | |
| s | 134 | |
| l | 127 | |
| p | 93 | |
| r | 91 | |
| e | 88 | 7.1% |
| d | 72 | 5.8% |
| y | 59 | 4.7% |
| b | 58 | 4.6% |
| Other values (8) | 226 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3383 | |
| 1 | 2646 | |
| 9 | 1927 | |
| 3 | 1814 | |
| 4 | 1811 | |
| 2 | 1555 | |
| 5 | 1043 | 6.6% |
| 7 | 775 | 4.9% |
| 6 | 471 | 3.0% |
| 8 | 424 | 2.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1502 | |
| % | 173 | 8.2% |
| . | 141 | 6.7% |
| , | 126 | 5.9% |
| & | 116 | 5.5% |
| ' | 49 | 2.3% |
| " | 7 | 0.3% |
| ? | 6 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 25121 | ||
| 3 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6317 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2223 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2221 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 63 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 17 |
Other Symbol
| Value | Count | Frequency (%) |
| ® | 9 |
Control
| Value | Count | Frequency (%) |
| | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 287729 | |
| Common | 53944 | 15.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 35249 | |
| S | 28516 | 9.9% |
| T | 25151 | 8.7% |
| A | 24031 | 8.4% |
| O | 20044 | 7.0% |
| L | 19072 | 6.6% |
| R | 16000 | 5.6% |
| I | 15638 | 5.4% |
| N | 13097 | 4.6% |
| C | 10597 | 3.7% |
| Other values (38) | 80334 |
Common
| Value | Count | Frequency (%) |
| 25121 | ||
| - | 6317 | 11.7% |
| 0 | 3383 | 6.3% |
| 1 | 2646 | 4.9% |
| ( | 2223 | 4.1% |
| ) | 2221 | 4.1% |
| 9 | 1927 | 3.6% |
| 3 | 1814 | 3.4% |
| 4 | 1811 | 3.4% |
| 2 | 1555 | 2.9% |
| Other values (17) | 4926 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 341623 | |
| None | 50 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 35249 | 10.3% |
| S | 28516 | 8.3% |
| T | 25151 | 7.4% |
| 25121 | 7.4% | |
| A | 24031 | 7.0% |
| O | 20044 | 5.9% |
| L | 19072 | 5.6% |
| R | 16000 | 4.7% |
| I | 15638 | 4.6% |
| N | 13097 | 3.8% |
| Other values (57) | 119704 |
None
| Value | Count | Frequency (%) |
| ´ | 17 | |
| É | 14 | |
| ® | 9 | |
| 3 | 6.0% | |
| Ã | 3 | 6.0% |
| Ê | 2 | 4.0% |
| Î | 1 | 2.0% |
| | 1 | 2.0% |
| Distinct | 134 |
|---|---|
| Distinct (%) | 8.6% |
| Missing | 37770 |
| Missing (%) | 96.1% |
| Memory size | 1.2 MiB |
| VETEMENT | |
|---|---|
| SWI | |
| SNO | 79 |
| T-SHIRT | 69 |
| MATERIEL RANDONNEE | 50 |
| Other values (129) |
Length
| Max length | 33 |
|---|---|
| Median length | 28 |
| Mean length | 7.914948454 |
| Min length | 2 |
Characters and Unicode
| Total characters | 12284 |
|---|---|
| Distinct characters | 61 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 58 ? |
|---|---|
| Unique (%) | 3.7% |
Sample
| 1st row | MATERIEL RANDONNEE |
|---|---|
| 2nd row | SNO |
| 3rd row | MID CUT DETENTE |
| 4th row | JUNIOR |
| 5th row | FERMANT |
Common Values
| Value | Count | Frequency (%) |
| VETEMENT | 413 | 1.1% |
| SWI | 131 | 0.3% |
| SNO | 79 | 0.2% |
| T-SHIRT | 69 | 0.2% |
| MATERIEL RANDONNEE | 50 | 0.1% |
| SAC A DOS | 47 | 0.1% |
| FERMANT | 47 | 0.1% |
| MANCHES COURTES | 35 | 0.1% |
| CHAUSSURE | 34 | 0.1% |
| ACCESSOIRES | 34 | 0.1% |
| Other values (124) | 613 | 1.6% |
| (Missing) | 37770 |
Length
| Value | Count | Frequency (%) |
| vetement | 413 | |
| swi | 131 | 6.6% |
| sno | 79 | 4.0% |
| t-shirt | 76 | 3.8% |
| manches | 69 | 3.5% |
| sac | 67 | 3.4% |
| materiel | 50 | 2.5% |
| randonnee | 50 | 2.5% |
| a | 48 | 2.4% |
| dos | 47 | 2.4% |
| Other values (166) | 960 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 2206 | |
| T | 1419 | |
| N | 996 | 8.1% |
| S | 965 | 7.9% |
| M | 704 | 5.7% |
| A | 697 | 5.7% |
| R | 518 | 4.2% |
| C | 499 | 4.1% |
| V | 486 | 4.0% |
| I | 485 | 3.9% |
| Other values (51) | 3309 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 11326 | |
| Space Separator | 438 | 3.6% |
| Decimal Number | 267 | 2.2% |
| Lowercase Letter | 126 | 1.0% |
| Dash Punctuation | 110 | 0.9% |
| Other Punctuation | 14 | 0.1% |
| Math Symbol | 1 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 2206 | |
| T | 1419 | |
| N | 996 | |
| S | 965 | |
| M | 704 | 6.2% |
| A | 697 | 6.2% |
| R | 518 | 4.6% |
| C | 499 | 4.4% |
| V | 486 | 4.3% |
| I | 485 | 4.3% |
| Other values (16) | 2351 |
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 14 | |
| i | 13 | |
| t | 13 | |
| e | 12 | |
| u | 11 | |
| a | 8 | 6.3% |
| h | 8 | 6.3% |
| n | 7 | 5.6% |
| c | 7 | 5.6% |
| p | 6 | 4.8% |
| Other values (9) | 27 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 66 | |
| 0 | 34 | |
| 4 | 32 | |
| 8 | 29 | |
| 2 | 28 | |
| 6 | 27 | |
| 9 | 24 | 9.0% |
| 3 | 18 | 6.7% |
| 5 | 8 | 3.0% |
| 7 | 1 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 438 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 110 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 14 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11452 | |
| Common | 832 | 6.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 2206 | |
| T | 1419 | |
| N | 996 | |
| S | 965 | 8.4% |
| M | 704 | 6.1% |
| A | 697 | 6.1% |
| R | 518 | 4.5% |
| C | 499 | 4.4% |
| V | 486 | 4.2% |
| I | 485 | 4.2% |
| Other values (35) | 2477 |
Common
| Value | Count | Frequency (%) |
| 438 | ||
| - | 110 | 13.2% |
| 1 | 66 | 7.9% |
| 0 | 34 | 4.1% |
| 4 | 32 | 3.8% |
| 8 | 29 | 3.5% |
| 2 | 28 | 3.4% |
| 6 | 27 | 3.2% |
| 9 | 24 | 2.9% |
| 3 | 18 | 2.2% |
| Other values (6) | 26 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12284 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 2206 | |
| T | 1419 | |
| N | 996 | 8.1% |
| S | 965 | 7.9% |
| M | 704 | 5.7% |
| A | 697 | 5.7% |
| R | 518 | 4.2% |
| C | 499 | 4.1% |
| V | 486 | 4.0% |
| I | 485 | 3.9% |
| Other values (51) | 3309 |
| Distinct | 263 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 14414 |
| Missing (%) | 36.7% |
| Memory size | 307.3 KiB |
| Minimum | 2000-01-01 00:00:00 |
|---|---|
| Maximum | 2021-05-25 00:00:00 |
| Distinct | 110 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 17318 |
| Missing (%) | 44.0% |
| Memory size | 307.3 KiB |
| Minimum | 2017-05-31 00:00:00 |
|---|---|
| Maximum | 2099-01-01 00:00:00 |
length
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 243 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.591936397 |
| Minimum | 0 |
|---|---|
| Maximum | 10000 |
| Zeros | 38334 |
| Zeros (%) | 97.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 307.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 10000 |
| Range | 10000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 106.0555133 |
|---|---|
| Coefficient of variation (CV) | 18.96579392 |
| Kurtosis | 4414.817351 |
| Mean | 5.591936397 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 55.40474886 |
| Sum | 219886.123 |
| Variance | 11247.7719 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 38334 | |
| 0.5 | 72 | 0.2% |
| 35 | 51 | 0.1% |
| 31.5 | 50 | 0.1% |
| 48 | 44 | 0.1% |
| 34.5 | 38 | 0.1% |
| 61 | 28 | 0.1% |
| 45 | 21 | 0.1% |
| 22 | 20 | 0.1% |
| 27 | 16 | < 0.1% |
| Other values (233) | 648 | 1.6% |
| Value | Count | Frequency (%) |
| 0 | 38334 | |
| 0.01 | 4 | < 0.1% |
| 0.011 | 4 | < 0.1% |
| 0.154 | 1 | < 0.1% |
| 0.163 | 1 | < 0.1% |
| 0.167 | 1 | < 0.1% |
| 0.2 | 1 | < 0.1% |
| 0.201 | 1 | < 0.1% |
| 0.222 | 1 | < 0.1% |
| 0.226 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 10000 | 2 | < 0.1% |
| 5500 | 1 | < 0.1% |
| 4000 | 1 | < 0.1% |
| 2740 | 10 | |
| 2600 | 1 | < 0.1% |
| 2500 | 1 | < 0.1% |
| 2230 | 1 | < 0.1% |
| 2130 | 1 | < 0.1% |
| 2030 | 1 | < 0.1% |
| 1800 | 2 | < 0.1% |
width
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 257 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.538041122 |
| Minimum | 0 |
|---|---|
| Maximum | 10000 |
| Zeros | 38235 |
| Zeros (%) | 97.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 307.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 10000 |
| Range | 10000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 68.0986718 |
|---|---|
| Coefficient of variation (CV) | 19.24756368 |
| Kurtosis | 12023.37956 |
| Mean | 3.538041122 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 88.89368975 |
| Sum | 139122.853 |
| Variance | 4637.429101 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 38235 | |
| 0.5 | 71 | 0.2% |
| 27 | 71 | 0.2% |
| 16.5 | 58 | 0.1% |
| 152 | 44 | 0.1% |
| 20 | 37 | 0.1% |
| 30 | 36 | 0.1% |
| 163 | 33 | 0.1% |
| 28 | 22 | 0.1% |
| 25 | 18 | < 0.1% |
| Other values (247) | 697 | 1.8% |
| Value | Count | Frequency (%) |
| 0 | 38235 | |
| 0.001 | 2 | < 0.1% |
| 0.002 | 1 | < 0.1% |
| 0.004 | 2 | < 0.1% |
| 0.006 | 1 | < 0.1% |
| 0.008 | 1 | < 0.1% |
| 0.01 | 10 | < 0.1% |
| 0.011 | 5 | < 0.1% |
| 0.012 | 4 | < 0.1% |
| 0.014 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 10000 | 1 | < 0.1% |
| 3000 | 1 | < 0.1% |
| 1800 | 2 | < 0.1% |
| 1520 | 10 | |
| 1220 | 1 | < 0.1% |
| 1160 | 1 | < 0.1% |
| 1050 | 9 | |
| 1010 | 2 | < 0.1% |
| 990 | 1 | < 0.1% |
| 970 | 5 |
height
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 168 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.621937567 |
| Minimum | 0 |
|---|---|
| Maximum | 2000 |
| Zeros | 38279 |
| Zeros (%) | 97.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 307.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 2000 |
| Range | 2000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 28.45091296 |
|---|---|
| Coefficient of variation (CV) | 17.54131204 |
| Kurtosis | 1384.818326 |
| Mean | 1.621937567 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 32.8271002 |
| Sum | 63777.829 |
| Variance | 809.4544484 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 38279 | |
| 5 | 85 | 0.2% |
| 1 | 82 | 0.2% |
| 5.76 | 71 | 0.2% |
| 10.5 | 48 | 0.1% |
| 62 | 44 | 0.1% |
| 2 | 33 | 0.1% |
| 1.5 | 33 | 0.1% |
| 12 | 32 | 0.1% |
| 4 | 32 | 0.1% |
| Other values (158) | 583 | 1.5% |
| Value | Count | Frequency (%) |
| 0 | 38279 | |
| 0.005 | 3 | < 0.1% |
| 0.006 | 4 | < 0.1% |
| 0.01 | 4 | < 0.1% |
| 0.015 | 14 | < 0.1% |
| 0.02 | 4 | < 0.1% |
| 0.051 | 1 | < 0.1% |
| 0.087 | 4 | < 0.1% |
| 0.1 | 1 | < 0.1% |
| 0.125 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 2000 | 1 | < 0.1% |
| 1330 | 1 | < 0.1% |
| 1130 | 2 | < 0.1% |
| 950 | 8 | |
| 920 | 1 | < 0.1% |
| 860 | 2 | < 0.1% |
| 820 | 1 | < 0.1% |
| 810 | 1 | < 0.1% |
| 800 | 1 | < 0.1% |
| 760 | 10 |
| Distinct | 4399 |
|---|---|
| Distinct (%) | 16.5% |
| Missing | 12645 |
| Missing (%) | 32.2% |
| Memory size | 1.9 MiB |
| 10776 | 1756 |
|---|---|
| 095A | 1021 |
| 001 | 916 |
| 010 | 597 |
| 001A | 427 |
| Other values (4394) |
Length
| Max length | 15 |
|---|---|
| Median length | 10 |
| Mean length | 3.611238145 |
| Min length | 1 |
Characters and Unicode
| Total characters | 96337 |
|---|---|
| Distinct characters | 37 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2507 ? |
|---|---|
| Unique (%) | 9.4% |
Sample
| 1st row | AC4H |
|---|---|
| 2nd row | AE2W |
| 3rd row | 001A |
| 4th row | A433 |
| 5th row | BLA |
Common Values
| Value | Count | Frequency (%) |
| 10776 | 1756 | 4.5% |
| 095A | 1021 | 2.6% |
| 001 | 916 | 2.3% |
| 010 | 597 | 1.5% |
| 001A | 427 | 1.1% |
| BDS | 405 | 1.0% |
| BLA | 391 | 1.0% |
| 000 | 382 | 1.0% |
| A0QM | 381 | 1.0% |
| 01F7 | 372 | 0.9% |
| Other values (4389) | 20029 | |
| (Missing) | 12645 |
Length
| Value | Count | Frequency (%) |
| 10776 | 1756 | 6.6% |
| 095a | 1021 | 3.8% |
| 001 | 916 | 3.4% |
| 010 | 597 | 2.2% |
| 001a | 427 | 1.6% |
| bds | 405 | 1.5% |
| bla | 391 | 1.5% |
| 000 | 382 | 1.4% |
| a0qm | 381 | 1.4% |
| 01f7 | 372 | 1.4% |
| Other values (4389) | 20031 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 23238 | |
| 1 | 11973 | |
| 7 | 7078 | 7.3% |
| A | 6083 | 6.3% |
| 5 | 4868 | 5.1% |
| 2 | 4475 | 4.6% |
| 6 | 4351 | 4.5% |
| 9 | 4198 | 4.4% |
| 3 | 4075 | 4.2% |
| 4 | 3263 | 3.4% |
| Other values (27) | 22735 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 70063 | |
| Uppercase Letter | 26272 | 27.3% |
| Space Separator | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 6083 | |
| B | 3141 | |
| D | 1954 | 7.4% |
| K | 1188 | 4.5% |
| E | 1138 | 4.3% |
| S | 1064 | 4.0% |
| L | 1029 | 3.9% |
| W | 1000 | 3.8% |
| M | 959 | 3.7% |
| F | 921 | 3.5% |
| Other values (16) | 7795 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 23238 | |
| 1 | 11973 | |
| 7 | 7078 | 10.1% |
| 5 | 4868 | 6.9% |
| 2 | 4475 | 6.4% |
| 6 | 4351 | 6.2% |
| 9 | 4198 | 6.0% |
| 3 | 4075 | 5.8% |
| 4 | 3263 | 4.7% |
| 8 | 2544 | 3.6% |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 70065 | |
| Latin | 26272 | 27.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 6083 | |
| B | 3141 | |
| D | 1954 | 7.4% |
| K | 1188 | 4.5% |
| E | 1138 | 4.3% |
| S | 1064 | 4.0% |
| L | 1029 | 3.9% |
| W | 1000 | 3.8% |
| M | 959 | 3.7% |
| F | 921 | 3.5% |
| Other values (16) | 7795 |
Common
| Value | Count | Frequency (%) |
| 0 | 23238 | |
| 1 | 11973 | |
| 7 | 7078 | 10.1% |
| 5 | 4868 | 6.9% |
| 2 | 4475 | 6.4% |
| 6 | 4351 | 6.2% |
| 9 | 4198 | 6.0% |
| 3 | 4075 | 5.8% |
| 4 | 3263 | 4.7% |
| 8 | 2544 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 96337 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 23238 | |
| 1 | 11973 | |
| 7 | 7078 | 7.3% |
| A | 6083 | 6.3% |
| 5 | 4868 | 5.1% |
| 2 | 4475 | 4.6% |
| 6 | 4351 | 4.5% |
| 9 | 4198 | 4.4% |
| 3 | 4075 | 4.2% |
| 4 | 3263 | 3.4% |
| Other values (27) | 22735 |
| Distinct | 12907 |
|---|---|
| Distinct (%) | 32.8% |
| Missing | 14 |
| Missing (%) | < 0.1% |
| Memory size | 2.6 MiB |
| BLACK | 2324 |
|---|---|
| NS | 2039 |
| NOIR | 693 |
| 095A BLACK | 446 |
| BLANC | 333 |
| Other values (12902) |
Length
| Max length | 35 |
|---|---|
| Median length | 27 |
| Mean length | 13.50208609 |
| Min length | 1 |
Characters and Unicode
| Total characters | 530740 |
|---|---|
| Distinct characters | 66 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 8867 ? |
|---|---|
| Unique (%) | 22.6% |
Sample
| 1st row | AC4H TRUGR7/TRUGR7/FTWWHT |
|---|---|
| 2nd row | NOIR |
| 3rd row | DEEP MARINE |
| 4th row | 001A WHITE/BLACK |
| 5th row | DARK PINK-BLACK |
Common Values
| Value | Count | Frequency (%) |
| BLACK | 2324 | 5.9% |
| NS | 2039 | 5.2% |
| NOIR | 693 | 1.8% |
| 095A BLACK | 446 | 1.1% |
| BLANC | 333 | 0.8% |
| 095A BLACK/WHITE | 307 | 0.8% |
| WHITE | 292 | 0.7% |
| BLA BLACK | 265 | 0.7% |
| 000 ONECOLOR | 218 | 0.6% |
| 0019 BLACK | 196 | 0.5% |
| Other values (12897) | 32195 |
Length
| Value | Count | Frequency (%) |
| black | 7596 | 9.4% |
| ns | 2105 | 2.6% |
| white | 1814 | 2.2% |
| blue | 1568 | 1.9% |
| noir | 1146 | 1.4% |
| 1046 | 1.3% | |
| navy | 1036 | 1.3% |
| grey | 1033 | 1.3% |
| 095a | 1021 | 1.3% |
| bleu | 665 | 0.8% |
| Other values (12864) | 61944 |
Most occurring characters
| Value | Count | Frequency (%) |
| 49005 | 9.2% | |
| A | 40237 | 7.6% |
| E | 35689 | 6.7% |
| L | 33289 | 6.3% |
| R | 26216 | 4.9% |
| C | 26027 | 4.9% |
| B | 25823 | 4.9% |
| I | 22365 | 4.2% |
| T | 21708 | 4.1% |
| N | 21610 | 4.1% |
| Other values (56) | 228771 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 402923 | |
| Decimal Number | 59722 | 11.3% |
| Space Separator | 49015 | 9.2% |
| Other Punctuation | 15687 | 3.0% |
| Dash Punctuation | 3220 | 0.6% |
| Connector Punctuation | 74 | < 0.1% |
| Math Symbol | 29 | < 0.1% |
| Open Punctuation | 29 | < 0.1% |
| Close Punctuation | 29 | < 0.1% |
| Lowercase Letter | 9 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 40237 | 10.0% |
| E | 35689 | 8.9% |
| L | 33289 | 8.3% |
| R | 26216 | 6.5% |
| C | 26027 | 6.5% |
| B | 25823 | 6.4% |
| I | 22365 | 5.6% |
| T | 21708 | 5.4% |
| N | 21610 | 5.4% |
| K | 19694 | 4.9% |
| Other values (19) | 130265 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 18549 | |
| 1 | 9630 | |
| 5 | 5330 | 8.9% |
| 9 | 4654 | 7.8% |
| 2 | 4325 | 7.2% |
| 3 | 4062 | 6.8% |
| 7 | 3965 | 6.6% |
| 4 | 3435 | 5.8% |
| 6 | 3094 | 5.2% |
| 8 | 2678 | 4.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 15158 | |
| . | 341 | 2.2% |
| , | 157 | 1.0% |
| & | 19 | 0.1% |
| : | 5 | < 0.1% |
| ' | 4 | < 0.1% |
| # | 1 | < 0.1% |
| * | 1 | < 0.1% |
| ! | 1 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| p | 1 | |
| h | 1 | |
| a | 1 | |
| l | 1 | |
| t | 1 | |
| r | 1 | |
| e | 1 | |
| y | 1 | |
| s | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 49005 | ||
| 10 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3220 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 74 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 29 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 29 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 29 |
Currency Symbol
| Value | Count | Frequency (%) |
| £ | 2 |
Other Symbol
| Value | Count | Frequency (%) |
| © | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 402932 | |
| Common | 127808 | 24.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 40237 | 10.0% |
| E | 35689 | 8.9% |
| L | 33289 | 8.3% |
| R | 26216 | 6.5% |
| C | 26027 | 6.5% |
| B | 25823 | 6.4% |
| I | 22365 | 5.6% |
| T | 21708 | 5.4% |
| N | 21610 | 5.4% |
| K | 19694 | 4.9% |
| Other values (28) | 130274 |
Common
| Value | Count | Frequency (%) |
| 49005 | ||
| 0 | 18549 | 14.5% |
| / | 15158 | 11.9% |
| 1 | 9630 | 7.5% |
| 5 | 5330 | 4.2% |
| 9 | 4654 | 3.6% |
| 2 | 4325 | 3.4% |
| 3 | 4062 | 3.2% |
| 7 | 3965 | 3.1% |
| 4 | 3435 | 2.7% |
| Other values (18) | 9695 | 7.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 530677 | |
| None | 63 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 49005 | 9.2% | |
| A | 40237 | 7.6% |
| E | 35689 | 6.7% |
| L | 33289 | 6.3% |
| R | 26216 | 4.9% |
| C | 26027 | 4.9% |
| B | 25823 | 4.9% |
| I | 22365 | 4.2% |
| T | 21708 | 4.1% |
| N | 21610 | 4.1% |
| Other values (50) | 228708 |
None
| Value | Count | Frequency (%) |
| Ç | 43 | |
| 10 | 15.9% | |
| É | 6 | 9.5% |
| £ | 2 | 3.2% |
| Ã | 1 | 1.6% |
| © | 1 | 1.6% |
| Distinct | 15 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 19630 |
| Missing (%) | 49.9% |
| Memory size | 1.7 MiB |
| HO | |
|---|---|
| FE | |
| UN | |
| GA | |
| UE | |
| Other values (10) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 39384 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | FE |
|---|---|
| 2nd row | FE |
| 3rd row | HO |
| 4th row | HO |
| 5th row | UN |
Common Values
| Value | Count | Frequency (%) |
| HO | 6583 | 16.7% |
| FE | 5497 | 14.0% |
| UN | 3759 | 9.6% |
| GA | 1137 | 2.9% |
| UE | 1046 | 2.7% |
| UA | 823 | 2.1% |
| FI | 688 | 1.7% |
| BG | 64 | 0.2% |
| UB | 33 | 0.1% |
| BF | 25 | 0.1% |
| Other values (5) | 37 | 0.1% |
| (Missing) | 19630 |
Length
| Value | Count | Frequency (%) |
| ho | 6583 | |
| fe | 5497 | |
| un | 3759 | |
| ga | 1137 | 5.8% |
| ue | 1046 | 5.3% |
| ua | 823 | 4.2% |
| fi | 688 | 3.5% |
| bg | 64 | 0.3% |
| ub | 33 | 0.2% |
| bf | 25 | 0.1% |
| Other values (5) | 37 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| H | 6583 | |
| O | 6583 | |
| E | 6569 | |
| F | 6210 | |
| U | 5673 | |
| N | 3779 | |
| A | 1960 | 5.0% |
| G | 1201 | 3.0% |
| I | 688 | 1.7% |
| B | 125 | 0.3% |
| Other values (2) | 13 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 39384 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 6583 | |
| O | 6583 | |
| E | 6569 | |
| F | 6210 | |
| U | 5673 | |
| N | 3779 | |
| A | 1960 | 5.0% |
| G | 1201 | 3.0% |
| I | 688 | 1.7% |
| B | 125 | 0.3% |
| Other values (2) | 13 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 39384 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| H | 6583 | |
| O | 6583 | |
| E | 6569 | |
| F | 6210 | |
| U | 5673 | |
| N | 3779 | |
| A | 1960 | 5.0% |
| G | 1201 | 3.0% |
| I | 688 | 1.7% |
| B | 125 | 0.3% |
| Other values (2) | 13 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39384 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| H | 6583 | |
| O | 6583 | |
| E | 6569 | |
| F | 6210 | |
| U | 5673 | |
| N | 3779 | |
| A | 1960 | 5.0% |
| G | 1201 | 3.0% |
| I | 688 | 1.7% |
| B | 125 | 0.3% |
| Other values (2) | 13 | < 0.1% |
| Distinct | 74 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 14402 |
| Missing (%) | 36.6% |
| Memory size | 1.8 MiB |
| CN | |
|---|---|
| FR | |
| VN | |
| BD | |
| DK | 1058 |
| Other values (69) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 49840 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | CN |
|---|---|
| 2nd row | BD |
| 3rd row | CN |
| 4th row | CN |
| 5th row | NL |
Common Values
| Value | Count | Frequency (%) |
| CN | 6437 | |
| FR | 3465 | 8.8% |
| VN | 1772 | 4.5% |
| BD | 1563 | 4.0% |
| DK | 1058 | 2.7% |
| IN | 1022 | 2.6% |
| FI | 952 | 2.4% |
| TR | 804 | 2.0% |
| KH | 796 | 2.0% |
| NL | 633 | 1.6% |
| Other values (64) | 6418 | |
| (Missing) | 14402 |
Length
| Value | Count | Frequency (%) |
| cn | 6437 | |
| fr | 3465 | |
| vn | 1772 | 7.1% |
| bd | 1563 | 6.3% |
| dk | 1058 | 4.2% |
| in | 1022 | 4.1% |
| fi | 952 | 3.8% |
| tr | 804 | 3.2% |
| kh | 796 | 3.2% |
| nl | 633 | 2.5% |
| Other values (64) | 6418 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 10310 | |
| C | 6725 | |
| R | 4441 | |
| F | 4417 | |
| D | 3553 | 7.1% |
| T | 2923 | 5.9% |
| I | 2795 | 5.6% |
| K | 2665 | 5.3% |
| B | 1960 | 3.9% |
| V | 1791 | 3.6% |
| Other values (15) | 8260 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 49840 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 10310 | |
| C | 6725 | |
| R | 4441 | |
| F | 4417 | |
| D | 3553 | 7.1% |
| T | 2923 | 5.9% |
| I | 2795 | 5.6% |
| K | 2665 | 5.3% |
| B | 1960 | 3.9% |
| V | 1791 | 3.6% |
| Other values (15) | 8260 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 49840 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 10310 | |
| C | 6725 | |
| R | 4441 | |
| F | 4417 | |
| D | 3553 | 7.1% |
| T | 2923 | 5.9% |
| I | 2795 | 5.6% |
| K | 2665 | 5.3% |
| B | 1960 | 3.9% |
| V | 1791 | 3.6% |
| Other values (15) | 8260 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49840 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 10310 | |
| C | 6725 | |
| R | 4441 | |
| F | 4417 | |
| D | 3553 | 7.1% |
| T | 2923 | 5.9% |
| I | 2795 | 5.6% |
| K | 2665 | 5.3% |
| B | 1960 | 3.9% |
| V | 1791 | 3.6% |
| Other values (15) | 8260 |
| Distinct | 78 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 14402 |
| Missing (%) | 36.6% |
| Memory size | 1.8 MiB |
| CN | |
|---|---|
| FR | |
| VN | |
| BD | |
| IN | |
| Other values (73) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 49840 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | CN |
|---|---|
| 2nd row | BD |
| 3rd row | CN |
| 4th row | CN |
| 5th row | TR |
Common Values
| Value | Count | Frequency (%) |
| CN | 8632 | |
| FR | 2283 | 5.8% |
| VN | 1913 | 4.9% |
| BD | 1868 | 4.8% |
| IN | 1157 | 2.9% |
| TR | 1126 | 2.9% |
| KH | 800 | 2.0% |
| DK | 767 | 2.0% |
| PK | 721 | 1.8% |
| TW | 519 | 1.3% |
| Other values (68) | 5134 | 13.1% |
| (Missing) | 14402 |
Length
| Value | Count | Frequency (%) |
| cn | 8632 | |
| fr | 2283 | 9.2% |
| vn | 1913 | 7.7% |
| bd | 1868 | 7.5% |
| in | 1157 | 4.6% |
| tr | 1126 | 4.5% |
| kh | 800 | 3.2% |
| dk | 767 | 3.1% |
| pk | 721 | 2.9% |
| tw | 519 | 2.1% |
| Other values (68) | 5134 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 12316 | |
| C | 8852 | |
| R | 3583 | 7.2% |
| T | 3284 | 6.6% |
| D | 3211 | 6.4% |
| K | 2749 | 5.5% |
| F | 2297 | 4.6% |
| I | 2041 | 4.1% |
| B | 2019 | 4.1% |
| V | 1936 | 3.9% |
| Other values (15) | 7552 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 49840 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 12316 | |
| C | 8852 | |
| R | 3583 | 7.2% |
| T | 3284 | 6.6% |
| D | 3211 | 6.4% |
| K | 2749 | 5.5% |
| F | 2297 | 4.6% |
| I | 2041 | 4.1% |
| B | 2019 | 4.1% |
| V | 1936 | 3.9% |
| Other values (15) | 7552 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 49840 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 12316 | |
| C | 8852 | |
| R | 3583 | 7.2% |
| T | 3284 | 6.6% |
| D | 3211 | 6.4% |
| K | 2749 | 5.5% |
| F | 2297 | 4.6% |
| I | 2041 | 4.1% |
| B | 2019 | 4.1% |
| V | 1936 | 3.9% |
| Other values (15) | 7552 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49840 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 12316 | |
| C | 8852 | |
| R | 3583 | 7.2% |
| T | 3284 | 6.6% |
| D | 3211 | 6.4% |
| K | 2749 | 5.5% |
| F | 2297 | 4.6% |
| I | 2041 | 4.1% |
| B | 2019 | 4.1% |
| V | 1936 | 3.9% |
| Other values (15) | 7552 |
| Distinct | 52 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 36549 |
| Missing (%) | 92.9% |
| Memory size | 1.3 MiB |
| GLEIZE | |
|---|---|
| NL | |
| BANGLADESH | |
| CHINE | |
| CN | |
| Other values (47) |
Length
| Max length | 21 |
|---|---|
| Median length | 11 |
| Mean length | 5.342589254 |
| Min length | 2 |
Characters and Unicode
| Total characters | 14815 |
|---|---|
| Distinct characters | 35 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | BANGLADESH |
|---|---|
| 2nd row | CHINE |
| 3rd row | GLEIZE |
| 4th row | MM |
| 5th row | CHINE |
Common Values
| Value | Count | Frequency (%) |
| GLEIZE | 518 | 1.3% |
| NL | 431 | 1.1% |
| BANGLADESH | 250 | 0.6% |
| CHINE | 221 | 0.6% |
| CN | 205 | 0.5% |
| CHINA | 157 | 0.4% |
| ITALIE | 103 | 0.3% |
| NETHERLANDS | 98 | 0.2% |
| TURKEY | 75 | 0.2% |
| FRANCE | 69 | 0.2% |
| Other values (42) | 646 | 1.6% |
| (Missing) | 36549 |
Length
| Value | Count | Frequency (%) |
| gleize | 518 | |
| nl | 431 | |
| bangladesh | 250 | 8.9% |
| chine | 221 | 7.9% |
| cn | 205 | 7.3% |
| china | 157 | 5.6% |
| netherlands | 148 | 5.3% |
| italie | 103 | 3.7% |
| turkey | 75 | 2.7% |
| france | 69 | 2.5% |
| Other values (46) | 625 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 2144 | |
| N | 1978 | |
| I | 1480 | |
| L | 1461 | |
| A | 1287 | |
| G | 838 | 5.7% |
| H | 811 | 5.5% |
| C | 673 | 4.5% |
| Z | 520 | 3.5% |
| D | 478 | 3.2% |
| Other values (25) | 3145 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 14286 | |
| Lowercase Letter | 500 | 3.4% |
| Space Separator | 29 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 2144 | |
| N | 1978 | |
| I | 1480 | |
| L | 1461 | |
| A | 1287 | |
| G | 838 | 5.9% |
| H | 811 | 5.7% |
| C | 673 | 4.7% |
| Z | 520 | 3.6% |
| D | 478 | 3.3% |
| Other values (15) | 2616 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 100 | |
| h | 50 | |
| r | 50 | |
| l | 50 | |
| a | 50 | |
| n | 50 | |
| d | 50 | |
| s | 50 | |
| t | 50 |
Space Separator
| Value | Count | Frequency (%) |
| 29 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14786 | |
| Common | 29 | 0.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 2144 | |
| N | 1978 | |
| I | 1480 | |
| L | 1461 | |
| A | 1287 | |
| G | 838 | 5.7% |
| H | 811 | 5.5% |
| C | 673 | 4.6% |
| Z | 520 | 3.5% |
| D | 478 | 3.2% |
| Other values (24) | 3116 |
Common
| Value | Count | Frequency (%) |
| 29 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14815 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 2144 | |
| N | 1978 | |
| I | 1480 | |
| L | 1461 | |
| A | 1287 | |
| G | 838 | 5.7% |
| H | 811 | 5.5% |
| C | 673 | 4.5% |
| Z | 520 | 3.5% |
| D | 478 | 3.2% |
| Other values (25) | 3145 |
| Distinct | 180 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 15834 |
| Missing (%) | 40.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20198176.2 |
| Minimum | 20170430 |
|---|---|
| Maximum | 20210412 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 307.3 KiB |
Quantile statistics
| Minimum | 20170430 |
|---|---|
| 5-th percentile | 20190801 |
| Q1 | 20200102 |
| median | 20200313 |
| Q3 | 20200701 |
| 95-th percentile | 20201015 |
| Maximum | 20210412 |
| Range | 39982 |
| Interquartile range (IQR) | 599 |
Descriptive statistics
| Standard deviation | 5816.493126 |
|---|---|
| Coefficient of variation (CV) | 0.0002879712044 |
| Kurtosis | 2.031064588 |
| Mean | 20198176.2 |
| Median Absolute Deviation (MAD) | 388 |
| Skewness | -1.17385975 |
| Sum | 4.744147625 × 1011 |
| Variance | 33831592.29 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20200715 | 1286 | 3.3% |
| 20200122 | 1161 | 3.0% |
| 20191210 | 903 | 2.3% |
| 20200224 | 842 | 2.1% |
| 20200415 | 769 | 2.0% |
| 20200815 | 621 | 1.6% |
| 20200302 | 618 | 1.6% |
| 20200701 | 614 | 1.6% |
| 20200313 | 608 | 1.5% |
| 20181231 | 518 | 1.3% |
| Other values (170) | 15548 | |
| (Missing) | 15834 |
| Value | Count | Frequency (%) |
| 20170430 | 12 | < 0.1% |
| 20180615 | 72 | 0.2% |
| 20180701 | 9 | < 0.1% |
| 20180901 | 507 | |
| 20181231 | 518 | |
| 20190101 | 10 | < 0.1% |
| 20190516 | 6 | < 0.1% |
| 20190601 | 1 | < 0.1% |
| 20190701 | 2 | < 0.1% |
| 20190801 | 125 | 0.3% |
| Value | Count | Frequency (%) |
| 20210412 | 2 | < 0.1% |
| 20210401 | 25 | 0.1% |
| 20210329 | 50 | 0.1% |
| 20210301 | 65 | 0.2% |
| 20210224 | 4 | < 0.1% |
| 20210201 | 237 | |
| 20210130 | 24 | 0.1% |
| 20210115 | 174 | |
| 20210111 | 21 | 0.1% |
| 20210101 | 418 |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.005551599613 |
| Minimum | 0 |
|---|---|
| Maximum | 3.15 |
| Zeros | 38117 |
| Zeros (%) | 96.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 307.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 3.15 |
| Range | 3.15 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.06018643961 |
|---|---|
| Coefficient of variation (CV) | 10.84127887 |
| Kurtosis | 640.8368092 |
| Mean | 0.005551599613 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 22.62168 |
| Sum | 218.3 |
| Variance | 0.003622407513 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 38117 | |
| 0.08 | 719 | 1.8% |
| 0.17 | 385 | 1.0% |
| 1 | 51 | 0.1% |
| 1.67 | 24 | 0.1% |
| 0.02 | 9 | < 0.1% |
| 0.04 | 8 | < 0.1% |
| 0.06 | 5 | < 0.1% |
| 0.1 | 3 | < 0.1% |
| 3.15 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 38117 | |
| 0.02 | 9 | < 0.1% |
| 0.04 | 8 | < 0.1% |
| 0.06 | 5 | < 0.1% |
| 0.08 | 719 | 1.8% |
| 0.1 | 3 | < 0.1% |
| 0.17 | 385 | 1.0% |
| 1 | 51 | 0.1% |
| 1.67 | 24 | 0.1% |
| 3.15 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3.15 | 1 | < 0.1% |
| 1.67 | 24 | 0.1% |
| 1 | 51 | 0.1% |
| 0.17 | 385 | 1.0% |
| 0.1 | 3 | < 0.1% |
| 0.08 | 719 | 1.8% |
| 0.06 | 5 | < 0.1% |
| 0.04 | 8 | < 0.1% |
| 0.02 | 9 | < 0.1% |
| 0 | 38117 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| 0.0 | |
|---|---|
| 3.15 | 1 |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.000025431 |
| Min length | 3 |
Characters and Unicode
| Total characters | 117967 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 39321 | |
| 3.15 | 1 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0.0 | 39321 | |
| 3.15 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 78642 | |
| . | 39322 | |
| 3 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 78645 | |
| Other Punctuation | 39322 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 78642 | |
| 3 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 39322 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 117967 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 78642 | |
| . | 39322 | |
| 3 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 117967 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 78642 | |
| . | 39322 | |
| 3 | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| Distinct | 72 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.78093688 |
| Minimum | 0 |
|---|---|
| Maximum | 324 |
| Zeros | 7440 |
| Zeros (%) | 18.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 307.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 10 |
| Maximum | 324 |
| Range | 324 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 9.613947674 |
|---|---|
| Coefficient of variation (CV) | 3.457089495 |
| Kurtosis | 110.0964599 |
| Mean | 2.78093688 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.333001584 |
| Sum | 109352 |
| Variance | 92.42798989 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 27840 | |
| 0 | 7440 | 18.9% |
| 10 | 1153 | 2.9% |
| 6 | 472 | 1.2% |
| 3 | 430 | 1.1% |
| 12 | 262 | 0.7% |
| 50 | 215 | 0.5% |
| 8 | 189 | 0.5% |
| 60 | 148 | 0.4% |
| 40 | 132 | 0.3% |
| Other values (62) | 1041 | 2.6% |
| Value | Count | Frequency (%) |
| 0 | 7440 | 18.9% |
| 1 | 27840 | |
| 2 | 127 | 0.3% |
| 3 | 430 | 1.1% |
| 4 | 10 | < 0.1% |
| 5 | 49 | 0.1% |
| 6 | 472 | 1.2% |
| 8 | 189 | 0.5% |
| 10 | 1153 | 2.9% |
| 11 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 324 | 1 | < 0.1% |
| 262 | 1 | < 0.1% |
| 210 | 2 | |
| 175 | 1 | < 0.1% |
| 162 | 1 | < 0.1% |
| 160 | 1 | < 0.1% |
| 150 | 1 | < 0.1% |
| 144 | 3 | |
| 140 | 4 | |
| 132 | 1 | < 0.1% |
| Distinct | 38 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.42505468 |
| Minimum | 0 |
|---|---|
| Maximum | 5000 |
| Zeros | 17829 |
| Zeros (%) | 45.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 307.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 1 |
| Maximum | 5000 |
| Range | 5000 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 211.6869363 |
|---|---|
| Coefficient of variation (CV) | 20.30559483 |
| Kurtosis | 535.7023781 |
| Mean | 10.42505468 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 23.04697745 |
| Sum | 409934 |
| Variance | 44811.35901 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 19655 | |
| 0 | 17829 | |
| 6 | 556 | 1.4% |
| 3 | 379 | 1.0% |
| 12 | 321 | 0.8% |
| 10 | 175 | 0.4% |
| 8 | 140 | 0.4% |
| 5000 | 68 | 0.2% |
| 5 | 56 | 0.1% |
| 2 | 25 | 0.1% |
| Other values (28) | 118 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 17829 | |
| 1 | 19655 | |
| 2 | 25 | 0.1% |
| 3 | 379 | 1.0% |
| 4 | 15 | < 0.1% |
| 5 | 56 | 0.1% |
| 6 | 556 | 1.4% |
| 7 | 4 | < 0.1% |
| 8 | 140 | 0.4% |
| 10 | 175 | 0.4% |
| Value | Count | Frequency (%) |
| 5000 | 68 | |
| 3000 | 3 | < 0.1% |
| 2500 | 3 | < 0.1% |
| 1500 | 3 | < 0.1% |
| 1000 | 13 | < 0.1% |
| 500 | 1 | < 0.1% |
| 272 | 1 | < 0.1% |
| 250 | 1 | < 0.1% |
| 200 | 2 | < 0.1% |
| 120 | 1 | < 0.1% |
net_weight
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 551 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.0214366 |
| Minimum | 0 |
|---|---|
| Maximum | 12500 |
| Zeros | 26860 |
| Zeros (%) | 68.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 307.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0.13 |
| 95-th percentile | 1 |
| Maximum | 12500 |
| Range | 12500 |
| Interquartile range (IQR) | 0.13 |
Descriptive statistics
| Standard deviation | 96.46051026 |
|---|---|
| Coefficient of variation (CV) | 19.20974373 |
| Kurtosis | 8707.920133 |
| Mean | 5.0214366 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 77.07187952 |
| Sum | 197452.93 |
| Variance | 9304.63004 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 26860 | |
| 0.5 | 664 | 1.7% |
| 0.3 | 618 | 1.6% |
| 0.35 | 539 | 1.4% |
| 0.1 | 437 | 1.1% |
| 1 | 344 | 0.9% |
| 0.2 | 343 | 0.9% |
| 0.01 | 334 | 0.8% |
| 0.8 | 315 | 0.8% |
| 0.4 | 277 | 0.7% |
| Other values (541) | 8591 | 21.8% |
| Value | Count | Frequency (%) |
| 0 | 26860 | |
| 0.01 | 334 | 0.8% |
| 0.02 | 100 | 0.3% |
| 0.03 | 211 | 0.5% |
| 0.04 | 226 | 0.6% |
| 0.05 | 220 | 0.6% |
| 0.06 | 166 | 0.4% |
| 0.07 | 185 | 0.5% |
| 0.08 | 183 | 0.5% |
| 0.09 | 211 | 0.5% |
| Value | Count | Frequency (%) |
| 12500 | 1 | < 0.1% |
| 8300 | 1 | < 0.1% |
| 4600 | 1 | < 0.1% |
| 1500 | 1 | < 0.1% |
| 1480 | 1 | < 0.1% |
| 1430 | 1 | < 0.1% |
| 1300 | 3 | |
| 1280 | 1 | < 0.1% |
| 1210 | 1 | < 0.1% |
| 1190 | 1 | < 0.1% |
raw_weight
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 455 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.27072097 |
| Minimum | 0 |
|---|---|
| Maximum | 14000 |
| Zeros | 30149 |
| Zeros (%) | 76.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 307.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0.8 |
| Maximum | 14000 |
| Range | 14000 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 98.86412425 |
|---|---|
| Coefficient of variation (CV) | 43.53864941 |
| Kurtosis | 13565.35116 |
| Mean | 2.27072097 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 108.8920108 |
| Sum | 89289.29 |
| Variance | 9774.115063 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 30149 | |
| 0.5 | 793 | 2.0% |
| 0.15 | 369 | 0.9% |
| 1 | 298 | 0.8% |
| 0.14 | 226 | 0.6% |
| 0.1 | 213 | 0.5% |
| 0.2 | 198 | 0.5% |
| 0.45 | 188 | 0.5% |
| 0.03 | 186 | 0.5% |
| 0.3 | 177 | 0.5% |
| Other values (445) | 6525 | 16.6% |
| Value | Count | Frequency (%) |
| 0 | 30149 | |
| 0.01 | 55 | 0.1% |
| 0.02 | 73 | 0.2% |
| 0.03 | 186 | 0.5% |
| 0.04 | 157 | 0.4% |
| 0.05 | 145 | 0.4% |
| 0.06 | 118 | 0.3% |
| 0.07 | 169 | 0.4% |
| 0.08 | 107 | 0.3% |
| 0.09 | 139 | 0.4% |
| Value | Count | Frequency (%) |
| 14000 | 1 | < 0.1% |
| 10000 | 1 | < 0.1% |
| 7100 | 1 | < 0.1% |
| 1500 | 1 | < 0.1% |
| 1480 | 1 | < 0.1% |
| 1430 | 1 | < 0.1% |
| 1280 | 1 | < 0.1% |
| 1210 | 1 | < 0.1% |
| 1190 | 1 | < 0.1% |
| 1170 | 3 |
| Distinct | 538 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.545065612 |
| Minimum | 0 |
|---|---|
| Maximum | 1010 |
| Zeros | 34170 |
| Zeros (%) | 86.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 307.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 3.6 |
| Maximum | 1010 |
| Range | 1010 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 22.98586909 |
|---|---|
| Coefficient of variation (CV) | 14.87695338 |
| Kurtosis | 1326.062656 |
| Mean | 1.545065612 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 34.39569575 |
| Sum | 60755.07 |
| Variance | 528.3501776 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 34170 | |
| 3.6 | 885 | 2.3% |
| 8.84 | 389 | 1.0% |
| 0.3 | 258 | 0.7% |
| 1 | 254 | 0.6% |
| 0.5 | 182 | 0.5% |
| 5.76 | 174 | 0.4% |
| 0.35 | 124 | 0.3% |
| 72 | 114 | 0.3% |
| 96 | 105 | 0.3% |
| Other values (528) | 2667 | 6.8% |
| Value | Count | Frequency (%) |
| 0 | 34170 | |
| 0.01 | 96 | 0.2% |
| 0.02 | 14 | < 0.1% |
| 0.03 | 11 | < 0.1% |
| 0.04 | 5 | < 0.1% |
| 0.05 | 17 | < 0.1% |
| 0.06 | 12 | < 0.1% |
| 0.07 | 6 | < 0.1% |
| 0.08 | 3 | < 0.1% |
| 0.09 | 10 | < 0.1% |
| Value | Count | Frequency (%) |
| 1010 | 4 | |
| 972.16 | 6 | |
| 924.46 | 1 | < 0.1% |
| 862.22 | 1 | < 0.1% |
| 859.63 | 1 | < 0.1% |
| 847.95 | 3 | < 0.1% |
| 644 | 9 | |
| 620 | 1 | < 0.1% |
| 494.83 | 1 | < 0.1% |
| 317.52 | 1 | < 0.1% |
| Distinct | 812 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
| L | |
|---|---|
| 2XL | |
| TU | |
| 10 | 1368 |
| 36 | 1125 |
| Other values (807) |
Length
| Max length | 13 |
|---|---|
| Median length | 12 |
| Mean length | 2.516046997 |
| Min length | 1 |
Characters and Unicode
| Total characters | 98936 |
|---|---|
| Distinct characters | 56 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 288 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | 38.5 |
|---|---|
| 2nd row | 36 |
| 3rd row | U |
| 4th row | 2XS |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| L | 7448 | |
| 2XL | 5648 | 14.4% |
| TU | 4507 | 11.5% |
| 10 | 1368 | 3.5% |
| 36 | 1125 | 2.9% |
| 3XL | 1052 | 2.7% |
| OS | 883 | 2.2% |
| 34 | 823 | 2.1% |
| 28 | 794 | 2.0% |
| 104 | 600 | 1.5% |
| Other values (802) | 15074 |
Length
| Value | Count | Frequency (%) |
| l | 7654 | |
| 2xl | 5648 | 14.0% |
| tu | 4507 | 11.2% |
| 10 | 1403 | 3.5% |
| 36 | 1162 | 2.9% |
| 3xl | 1052 | 2.6% |
| os | 987 | 2.4% |
| 34 | 869 | 2.2% |
| 28 | 811 | 2.0% |
| 35 | 808 | 2.0% |
| Other values (721) | 15436 |
Most occurring characters
| Value | Count | Frequency (%) |
| L | 15405 | |
| 2 | 11386 | |
| 1 | 9189 | 9.3% |
| 3 | 8372 | 8.5% |
| X | 7744 | 7.8% |
| 0 | 5600 | 5.7% |
| U | 5027 | 5.1% |
| T | 4666 | 4.7% |
| 4 | 3446 | 3.5% |
| 5 | 3319 | 3.4% |
| Other values (46) | 24782 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 48128 | |
| Uppercase Letter | 45419 | |
| Other Punctuation | 2425 | 2.5% |
| Space Separator | 1450 | 1.5% |
| Dash Punctuation | 1386 | 1.4% |
| Lowercase Letter | 84 | 0.1% |
| Math Symbol | 30 | < 0.1% |
| Other Number | 14 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 15405 | |
| X | 7744 | |
| U | 5027 | 11.1% |
| T | 4666 | 10.3% |
| S | 2925 | 6.4% |
| O | 2007 | 4.4% |
| E | 1476 | 3.2% |
| N | 1228 | 2.7% |
| I | 983 | 2.2% |
| A | 856 | 1.9% |
| Other values (16) | 3102 | 6.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 30 | |
| c | 11 | 13.1% |
| x | 9 | 10.7% |
| i | 7 | 8.3% |
| e | 7 | 8.3% |
| z | 5 | 6.0% |
| n | 4 | 4.8% |
| s | 3 | 3.6% |
| o | 2 | 2.4% |
| r | 2 | 2.4% |
| Other values (2) | 4 | 4.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 11386 | |
| 1 | 9189 | |
| 3 | 8372 | |
| 0 | 5600 | |
| 4 | 3446 | 7.2% |
| 5 | 3319 | 6.9% |
| 8 | 2397 | 5.0% |
| 6 | 2349 | 4.9% |
| 9 | 1218 | 2.5% |
| 7 | 852 | 1.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1340 | |
| . | 1077 | |
| " | 6 | 0.2% |
| ' | 2 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1450 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1386 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 30 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 14 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 53433 | |
| Latin | 45503 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| L | 15405 | |
| X | 7744 | |
| U | 5027 | 11.0% |
| T | 4666 | 10.3% |
| S | 2925 | 6.4% |
| O | 2007 | 4.4% |
| E | 1476 | 3.2% |
| N | 1228 | 2.7% |
| I | 983 | 2.2% |
| A | 856 | 1.9% |
| Other values (28) | 3186 | 7.0% |
Common
| Value | Count | Frequency (%) |
| 2 | 11386 | |
| 1 | 9189 | |
| 3 | 8372 | |
| 0 | 5600 | |
| 4 | 3446 | 6.4% |
| 5 | 3319 | 6.2% |
| 8 | 2397 | 4.5% |
| 6 | 2349 | 4.4% |
| 1450 | 2.7% | |
| - | 1386 | 2.6% |
| Other values (8) | 4539 | 8.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 98922 | |
| None | 14 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| L | 15405 | |
| 2 | 11386 | |
| 1 | 9189 | 9.3% |
| 3 | 8372 | 8.5% |
| X | 7744 | 7.8% |
| 0 | 5600 | 5.7% |
| U | 5027 | 5.1% |
| T | 4666 | 4.7% |
| 4 | 3446 | 3.5% |
| 5 | 3319 | 3.4% |
| Other values (45) | 24768 |
None
| Value | Count | Frequency (%) |
| ½ | 14 |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
| HO | |
|---|---|
| FE | |
| UN | |
| GA | |
| UA | |
| Other values (6) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 78644 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | HO |
|---|---|
| 2nd row | FE |
| 3rd row | UA |
| 4th row | FE |
| 5th row | UA |
Common Values
| Value | Count | Frequency (%) |
| HO | 14775 | |
| FE | 10372 | |
| UN | 4098 | 10.4% |
| GA | 4064 | 10.3% |
| UA | 2677 | 6.8% |
| FI | 1798 | 4.6% |
| UE | 744 | 1.9% |
| BG | 443 | 1.1% |
| BF | 246 | 0.6% |
| ND | 75 | 0.2% |
Length
| Value | Count | Frequency (%) |
| ho | 14775 | |
| fe | 10372 | |
| un | 4098 | 10.4% |
| ga | 4064 | 10.3% |
| ua | 2677 | 6.8% |
| fi | 1798 | 4.6% |
| ue | 744 | 1.9% |
| bg | 443 | 1.1% |
| bf | 246 | 0.6% |
| nd | 75 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| H | 14775 | |
| O | 14775 | |
| F | 12416 | |
| E | 11116 | |
| U | 7549 | |
| A | 6741 | |
| G | 4507 | 5.7% |
| N | 4173 | 5.3% |
| I | 1798 | 2.3% |
| B | 719 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 78644 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 14775 | |
| O | 14775 | |
| F | 12416 | |
| E | 11116 | |
| U | 7549 | |
| A | 6741 | |
| G | 4507 | 5.7% |
| N | 4173 | 5.3% |
| I | 1798 | 2.3% |
| B | 719 | 0.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 78644 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| H | 14775 | |
| O | 14775 | |
| F | 12416 | |
| E | 11116 | |
| U | 7549 | |
| A | 6741 | |
| G | 4507 | 5.7% |
| N | 4173 | 5.3% |
| I | 1798 | 2.3% |
| B | 719 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 78644 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| H | 14775 | |
| O | 14775 | |
| F | 12416 | |
| E | 11116 | |
| U | 7549 | |
| A | 6741 | |
| G | 4507 | 5.7% |
| N | 4173 | 5.3% |
| I | 1798 | 2.3% |
| B | 719 | 0.9% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.2 MiB |
| 2 | |
|---|---|
| 3 | |
| 1 | |
| 7 | 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 39322 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 3 |
| 3rd row | 1 |
| 4th row | 2 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 25246 | |
| 3 | 7064 | 18.0% |
| 1 | 7010 | 17.8% |
| 7 | 2 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 2 | 25246 | |
| 3 | 7064 | 18.0% |
| 1 | 7010 | 17.8% |
| 7 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 25246 | |
| 3 | 7064 | 18.0% |
| 1 | 7010 | 17.8% |
| 7 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 39322 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 25246 | |
| 3 | 7064 | 18.0% |
| 1 | 7010 | 17.8% |
| 7 | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 39322 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 25246 | |
| 3 | 7064 | 18.0% |
| 1 | 7010 | 17.8% |
| 7 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39322 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 25246 | |
| 3 | 7064 | 18.0% |
| 1 | 7010 | 17.8% |
| 7 | 2 | < 0.1% |
| Distinct | 44 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 43.42068054 |
| Minimum | 0 |
|---|---|
| Maximum | 98 |
| Zeros | 7602 |
| Zeros (%) | 19.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 307.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 46 |
| Q3 | 75 |
| 95-th percentile | 78 |
| Maximum | 98 |
| Range | 98 |
| Interquartile range (IQR) | 74 |
Descriptive statistics
| Standard deviation | 31.8987883 |
|---|---|
| Coefficient of variation (CV) | 0.7346450564 |
| Kurtosis | -1.631273847 |
| Mean | 43.42068054 |
| Median Absolute Deviation (MAD) | 29 |
| Skewness | -0.2755388946 |
| Sum | 1707388 |
| Variance | 1017.532695 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 75 | 13111 | |
| 0 | 7602 | |
| 64 | 3470 | 8.8% |
| 32 | 3152 | 8.0% |
| 1 | 2594 | 6.6% |
| 78 | 2055 | 5.2% |
| 46 | 1542 | 3.9% |
| 15 | 897 | 2.3% |
| 16 | 762 | 1.9% |
| 24 | 523 | 1.3% |
| Other values (34) | 3614 | 9.2% |
| Value | Count | Frequency (%) |
| 0 | 7602 | |
| 1 | 2594 | 6.6% |
| 2 | 45 | 0.1% |
| 3 | 156 | 0.4% |
| 4 | 157 | 0.4% |
| 5 | 14 | < 0.1% |
| 8 | 3 | < 0.1% |
| 14 | 499 | 1.3% |
| 15 | 897 | 2.3% |
| 16 | 762 | 1.9% |
| Value | Count | Frequency (%) |
| 98 | 11 | < 0.1% |
| 90 | 52 | 0.1% |
| 88 | 90 | 0.2% |
| 84 | 7 | < 0.1% |
| 82 | 60 | 0.2% |
| 80 | 99 | 0.3% |
| 78 | 2055 | 5.2% |
| 75 | 13111 | |
| 70 | 2 | < 0.1% |
| 69 | 31 | 0.1% |
| Distinct | 99 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 44.98471594 |
| Minimum | 0 |
|---|---|
| Maximum | 99 |
| Zeros | 72 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 307.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 12 |
| median | 43 |
| Q3 | 73 |
| 95-th percentile | 95 |
| Maximum | 99 |
| Range | 99 |
| Interquartile range (IQR) | 61 |
Descriptive statistics
| Standard deviation | 31.6833359 |
|---|---|
| Coefficient of variation (CV) | 0.7043133482 |
| Kurtosis | -1.313591499 |
| Mean | 44.98471594 |
| Median Absolute Deviation (MAD) | 31 |
| Skewness | 0.2360548141 |
| Sum | 1768889 |
| Variance | 1003.833774 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 4462 | 11.3% |
| 95 | 1819 | 4.6% |
| 90 | 1369 | 3.5% |
| 1 | 1200 | 3.1% |
| 62 | 1163 | 3.0% |
| 27 | 1095 | 2.8% |
| 51 | 1094 | 2.8% |
| 43 | 1011 | 2.6% |
| 44 | 965 | 2.5% |
| 36 | 929 | 2.4% |
| Other values (89) | 24215 |
| Value | Count | Frequency (%) |
| 0 | 72 | 0.2% |
| 1 | 1200 | |
| 2 | 918 | |
| 3 | 705 | |
| 4 | 298 | 0.8% |
| 5 | 549 | |
| 6 | 665 | |
| 7 | 471 | 1.2% |
| 8 | 234 | 0.6% |
| 9 | 77 | 0.2% |
| Value | Count | Frequency (%) |
| 99 | 714 | 1.8% |
| 98 | 260 | 0.7% |
| 97 | 13 | < 0.1% |
| 96 | 502 | 1.3% |
| 95 | 1819 | |
| 94 | 248 | 0.6% |
| 93 | 235 | 0.6% |
| 92 | 264 | 0.7% |
| 91 | 212 | 0.5% |
| 90 | 1369 |
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.599842327 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 1309 |
| Zeros (%) | 3.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 307.3 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 7 |
| 95-th percentile | 9 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.691454749 |
|---|---|
| Coefficient of variation (CV) | 0.5851189143 |
| Kurtosis | -1.265306425 |
| Mean | 4.599842327 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.002689232808 |
| Sum | 180875 |
| Variance | 7.243928665 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 6360 | |
| 1 | 5807 | |
| 4 | 5569 | |
| 2 | 4388 | |
| 8 | 4153 | |
| 5 | 3673 | |
| 3 | 2884 | |
| 9 | 2727 | |
| 6 | 2452 | 6.2% |
| 0 | 1309 | 3.3% |
| Value | Count | Frequency (%) |
| 0 | 1309 | 3.3% |
| 1 | 5807 | |
| 2 | 4388 | |
| 3 | 2884 | |
| 4 | 5569 | |
| 5 | 3673 | |
| 6 | 2452 | 6.2% |
| 7 | 6360 | |
| 8 | 4153 | |
| 9 | 2727 |
| Value | Count | Frequency (%) |
| 9 | 2727 | |
| 8 | 4153 | |
| 7 | 6360 | |
| 6 | 2452 | 6.2% |
| 5 | 3673 | |
| 4 | 5569 | |
| 3 | 2884 | |
| 2 | 4388 | |
| 1 | 5807 | |
| 0 | 1309 | 3.3% |
incorrect_fedas_1
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.217410101 |
| Minimum | -1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 10854 |
| Negative (%) | 27.6% |
| Memory size | 307.3 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | -1 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 7 |
| Range | 8 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.488598113 |
|---|---|
| Coefficient of variation (CV) | 1.222758142 |
| Kurtosis | -0.8573205619 |
| Mean | 1.217410101 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.451797563 |
| Sum | 47871 |
| Variance | 2.215924342 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 17161 | |
| -1 | 10854 | |
| 3 | 6183 | 15.7% |
| 1 | 4977 | 12.7% |
| 6 | 93 | 0.2% |
| 5 | 28 | 0.1% |
| 7 | 25 | 0.1% |
| 4 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| -1 | 10854 | |
| 1 | 4977 | 12.7% |
| 2 | 17161 | |
| 3 | 6183 | 15.7% |
| 4 | 1 | < 0.1% |
| 5 | 28 | 0.1% |
| 6 | 93 | 0.2% |
| 7 | 25 | 0.1% |
| Value | Count | Frequency (%) |
| 7 | 25 | 0.1% |
| 6 | 93 | 0.2% |
| 5 | 28 | 0.1% |
| 4 | 1 | < 0.1% |
| 3 | 6183 | 15.7% |
| 2 | 17161 | |
| 1 | 4977 | 12.7% |
| -1 | 10854 |
incorrect_fedas_2
Real number (ℝ)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 52 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 33.65810488 |
| Minimum | -1 |
|---|---|
| Maximum | 98 |
| Zeros | 3649 |
| Zeros (%) | 9.3% |
| Negative | 10854 |
| Negative (%) | 27.6% |
| Memory size | 307.3 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | -1 |
| median | 32 |
| Q3 | 75 |
| 95-th percentile | 78 |
| Maximum | 98 |
| Range | 99 |
| Interquartile range (IQR) | 76 |
Descriptive statistics
| Standard deviation | 33.5094127 |
|---|---|
| Coefficient of variation (CV) | 0.9955822772 |
| Kurtosis | -1.694404739 |
| Mean | 33.65810488 |
| Median Absolute Deviation (MAD) | 33 |
| Skewness | 0.2307684161 |
| Sum | 1323504 |
| Variance | 1122.88074 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -1 | 10854 | |
| 75 | 8201 | |
| 0 | 3649 | 9.3% |
| 78 | 3508 | 8.9% |
| 32 | 3112 | 7.9% |
| 64 | 2057 | 5.2% |
| 1 | 1268 | 3.2% |
| 46 | 1046 | 2.7% |
| 15 | 878 | 2.2% |
| 24 | 737 | 1.9% |
| Other values (42) | 4012 | 10.2% |
| Value | Count | Frequency (%) |
| -1 | 10854 | |
| 0 | 3649 | 9.3% |
| 1 | 1268 | 3.2% |
| 2 | 81 | 0.2% |
| 3 | 141 | 0.4% |
| 4 | 373 | 0.9% |
| 5 | 14 | < 0.1% |
| 8 | 5 | < 0.1% |
| 14 | 487 | 1.2% |
| 15 | 878 | 2.2% |
| Value | Count | Frequency (%) |
| 98 | 9 | < 0.1% |
| 90 | 3 | < 0.1% |
| 88 | 157 | 0.4% |
| 87 | 15 | < 0.1% |
| 86 | 13 | < 0.1% |
| 84 | 7 | < 0.1% |
| 82 | 32 | 0.1% |
| 81 | 6 | < 0.1% |
| 80 | 100 | 0.3% |
| 78 | 3508 |
| Distinct | 100 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.95239306 |
| Minimum | -1 |
|---|---|
| Maximum | 99 |
| Zeros | 67 |
| Zeros (%) | 0.2% |
| Negative | 10854 |
| Negative (%) | 27.6% |
| Memory size | 307.3 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | -1 |
| median | 22 |
| Q3 | 51 |
| 95-th percentile | 96 |
| Maximum | 99 |
| Range | 100 |
| Interquartile range (IQR) | 52 |
Descriptive statistics
| Standard deviation | 33.31714526 |
|---|---|
| Coefficient of variation (CV) | 1.076399656 |
| Kurtosis | -0.6911563711 |
| Mean | 30.95239306 |
| Median Absolute Deviation (MAD) | 23 |
| Skewness | 0.7984227528 |
| Sum | 1217110 |
| Variance | 1110.032168 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -1 | 10854 | |
| 12 | 2936 | 7.5% |
| 31 | 1605 | 4.1% |
| 90 | 1198 | 3.0% |
| 37 | 1182 | 3.0% |
| 1 | 1174 | 3.0% |
| 27 | 1066 | 2.7% |
| 29 | 1012 | 2.6% |
| 96 | 984 | 2.5% |
| 99 | 798 | 2.0% |
| Other values (90) | 16513 |
| Value | Count | Frequency (%) |
| -1 | 10854 | |
| 0 | 67 | 0.2% |
| 1 | 1174 | 3.0% |
| 2 | 136 | 0.3% |
| 3 | 369 | 0.9% |
| 4 | 204 | 0.5% |
| 5 | 360 | 0.9% |
| 6 | 754 | 1.9% |
| 7 | 248 | 0.6% |
| 8 | 94 | 0.2% |
| Value | Count | Frequency (%) |
| 99 | 798 | |
| 98 | 672 | |
| 97 | 20 | 0.1% |
| 96 | 984 | |
| 95 | 235 | 0.6% |
| 94 | 291 | 0.7% |
| 93 | 193 | 0.5% |
| 92 | 179 | 0.5% |
| 91 | 260 | 0.7% |
| 90 | 1198 |
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.789812319 |
| Minimum | -1 |
|---|---|
| Maximum | 9 |
| Zeros | 987 |
| Zeros (%) | 2.5% |
| Negative | 10854 |
| Negative (%) | 27.6% |
| Memory size | 307.3 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | -1 |
| median | 2 |
| Q3 | 6 |
| 95-th percentile | 9 |
| Maximum | 9 |
| Range | 10 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 3.304574955 |
|---|---|
| Coefficient of variation (CV) | 1.184515149 |
| Kurtosis | -1.155268062 |
| Mean | 2.789812319 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.3953028627 |
| Sum | 109701 |
| Variance | 10.92021563 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -1 | 10854 | |
| 1 | 5556 | |
| 7 | 3888 | 9.9% |
| 4 | 3659 | 9.3% |
| 2 | 3365 | 8.6% |
| 3 | 2841 | 7.2% |
| 8 | 2433 | 6.2% |
| 5 | 2172 | 5.5% |
| 9 | 2056 | 5.2% |
| 6 | 1511 | 3.8% |
| Value | Count | Frequency (%) |
| -1 | 10854 | |
| 0 | 987 | 2.5% |
| 1 | 5556 | |
| 2 | 3365 | 8.6% |
| 3 | 2841 | 7.2% |
| 4 | 3659 | 9.3% |
| 5 | 2172 | 5.5% |
| 6 | 1511 | 3.8% |
| 7 | 3888 | 9.9% |
| 8 | 2433 | 6.2% |
| Value | Count | Frequency (%) |
| 9 | 2056 | 5.2% |
| 8 | 2433 | |
| 7 | 3888 | |
| 6 | 1511 | 3.8% |
| 5 | 2172 | 5.5% |
| 4 | 3659 | |
| 3 | 2841 | |
| 2 | 3365 | |
| 1 | 5556 | |
| 0 | 987 | 2.5% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| brand | model_code | model_label | commercial_label | incorrect_fedas_code | article_main_category | article_type | article_detail | comment | avalability_start_date | avalability_end_date | length | width | height | color_code | color_label | inaccurate_gender | country_of_origin | country_of_manufacture | embakment_harbor | shipping_date | eco_participation | eco_furniture | multiple_of_order | minimum_multiple_of_order | net_weight | raw_weight | volume | size | accurate_gender | correct_fedas_1 | correct_fedas_2 | correct_fedas_3 | correct_fedas_4 | incorrect_fedas_1 | incorrect_fedas_2 | incorrect_fedas_3 | incorrect_fedas_4 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | brand_293 | S42783 | FLEXAGON ENERGY TR 3.0 MT | NaN | 378011 | TRAINING | HOMME | 09-SHOES (LOW) | NaN | 2020-12-01 | 2021-05-31 | 0.0 | 0.0 | 0.0 | AC4H | AC4H TRUGR7/TRUGR7/FTWWHT | NaN | NaN | NaN | NaN | NaN | 0.0 | 0.0 | 1 | 0 | 0.0 | 0.0 | 0.0 | 38.5 | HO | 3 | 78 | 10 | 1 | 3 | 78 | 1 | 1 |
| 1 | brand_3 | R1252 | TADEN PLUS FUR | NaN | GARDEN | RUBBER BOOTS | BOOTS | NaN | 2020-01-01 | 2020-12-31 | 0.0 | 0.0 | 0.0 | NaN | NOIR | FE | CN | CN | NaN | 20200715.0 | 0.0 | 0.0 | 1 | 1 | 0.0 | 0.0 | 0.0 | 36 | FE | 3 | 64 | 30 | 8 | -1 | -1 | -1 | -1 | |
| 2 | brand_265 | OXS917808 | POCHETTE PORTE TRAVERS PE | NaN | 175897 | SAC | HOMME | N1FARROW | MATERIEL RANDONNEE | 2021-01-01 | 2021-06-30 | 0.0 | 0.0 | 0.0 | AE2W | DEEP MARINE | NaN | NaN | NaN | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0.0 | 0.0 | 0.0 | U | UA | 1 | 75 | 89 | 0 | 1 | 75 | 89 | 7 |
| 3 | brand_1 | GM5253 | CLUB KNOT TANK | NaN | 224122 | RACKET SPORTS | FEMME | 21-TANK | NaN | 2020-12-01 | 2021-05-31 | 0.0 | 0.0 | 0.0 | 001A | 001A WHITE/BLACK | NaN | NaN | NaN | NaN | NaN | 0.0 | 0.0 | 1 | 0 | 0.0 | 0.0 | 0.0 | 2XS | FE | 2 | 24 | 11 | 8 | 2 | 24 | 12 | 2 |
| 4 | brand_12 | MS338 | BONITA DK PNK/BLCK M | NaN | NaN | NaN | NaN | SNO | 2020-01-01 | 2020-12-31 | 0.0 | 0.0 | 0.0 | NaN | DARK PINK-BLACK | NaN | NaN | NaN | NaN | NaN | 0.0 | 0.0 | 6 | 0 | 0.0 | 0.0 | 0.0 | M | UA | 1 | 15 | 94 | 4 | -1 | -1 | -1 | -1 | |
| 5 | brand_113 | 687628 | TALASI AOP DROPPED SHOULDER TEE | NaN | TARTAN CHECKS | MEN | T-SHIRT | NaN | NaT | NaT | 0.0 | 0.0 | 0.0 | A433 | A433 BLACK TARTAN ALLOVER | NaN | BD | BD | BANGLADESH | NaN | 0.0 | 0.0 | 0 | 0 | 0.0 | 0.0 | 0.0 | 2XL | HO | 2 | 0 | 12 | 4 | -1 | -1 | -1 | -1 | |
| 6 | brand_139 | PW7381P0302 | FLAT TOP ZIP HOLDALL | NaN | 175834 | PERIPHERAL | PERIPHERAL MISC | HOLDALL | NaN | NaT | NaT | 0.0 | 0.0 | 0.0 | BLA | BLACK | FE | CN | CN | NaN | 20191206.0 | 0.0 | 0.0 | 1 | 1 | 0.0 | 0.0 | 0.0 | TU | UN | 1 | 75 | 85 | 0 | 1 | 75 | 83 | 4 |
| 7 | brand_1 | FU9652 | STAN SMITH W | NaN | 375962 | SPORTSTYLE | FEMME | 09-SHOES (LOW) | NaN | 2020-11-01 | 2020-11-30 | 0.0 | 0.0 | 0.0 | A0QM | A0QM CBLACK/OWHITE/FTWWHT | NaN | NaN | NaN | NaN | NaN | 0.0 | 0.0 | 1 | 0 | 0.0 | 0.0 | 0.0 | 35.5 | FE | 3 | 75 | 2 | 2 | 3 | 75 | 96 | 2 |
| 8 | brand_303 | ARGL100287 | RG SLIPPY II G | NaN | 315903 | LOISIRS | FILLE | SANDALE | NaN | 2021-02-01 | NaT | 0.0 | 0.0 | 0.0 | BL0 | BL0 BL0-BLACK | NaN | CN | CN | CHINE | 20210201.0 | 0.0 | 0.0 | 0 | 0 | 0.0 | 0.0 | 0.0 | 20 | FI | 3 | 15 | 93 | 3 | 3 | 15 | 90 | 3 |
| 9 | brand_241 | 11421859 | TEAM LOGO PO HOODY SADPAD BLEU | NaN | 275297 | TEXTILE HOMME | SWEAT | SWEAT CAPUCHE HOMME | NaN | NaT | NaT | 0.0 | 0.0 | 0.0 | 410 | BLEU | HO | NL | TR | NaN | 20200224.0 | 0.0 | 0.0 | 1 | 6 | 0.5 | 0.5 | 0.0 | 3XL | UN | 1 | 37 | 79 | 1 | 2 | 75 | 29 | 7 |
Last rows
| brand | model_code | model_label | commercial_label | incorrect_fedas_code | article_main_category | article_type | article_detail | comment | avalability_start_date | avalability_end_date | length | width | height | color_code | color_label | inaccurate_gender | country_of_origin | country_of_manufacture | embakment_harbor | shipping_date | eco_participation | eco_furniture | multiple_of_order | minimum_multiple_of_order | net_weight | raw_weight | volume | size | accurate_gender | correct_fedas_1 | correct_fedas_2 | correct_fedas_3 | correct_fedas_4 | incorrect_fedas_1 | incorrect_fedas_2 | incorrect_fedas_3 | incorrect_fedas_4 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 39312 | brand_77 | M9621C | CHUCK TAYLOR ALL STAR | NaN | 375954 | CHAUSSURE | BASKET | SPORTSWEAR | NaN | 2021-01-01 | NaT | 0.0 | 0.0 | 0.0 | 600 | 600 RED | HO | BE | ID | NaN | 20200503.0 | 0.0 | 0.0 | 6 | 12 | 0.00 | 0.00 | 0.00 | 35 | HO | 3 | 75 | 85 | 1 | 3 | 75 | 95 | 4 |
| 39313 | brand_182 | KI0306 | SAC ISOTHERME | SAC ISOTHERME | COLLECTIVITES | SAC | NaN | NaN | NaT | NaT | 0.0 | 0.0 | 0.0 | 37738 | 37738 37738 LIGHT GREY | UN | CN | CN | NaN | 20200415.0 | 0.0 | 0.0 | 1 | 1 | 0.30 | 0.33 | 3.14 | TU | UN | 1 | 75 | 83 | 0 | -1 | -1 | -1 | -1 | |
| 39314 | brand_272 | PHMO99006CHR4 | MOUSTIQUAIRE BANGLA | NaN | 100311 | COSMETIQUE | MOUSTIQUAIRE 2 PLACES | FORME CUBIQUE | NaN | NaT | NaT | 0.0 | 0.0 | 0.0 | NaN | NS | UN | FR | FR | NaN | 20200102.0 | 0.0 | 0.0 | 1 | 1 | 0.00 | 0.00 | 0.00 | TU | UN | 1 | 67 | 99 | 0 | 1 | 0 | 31 | 1 |
| 39315 | brand_383 | DW0DW09475 | SYLVIA HR SPR SKNY ANKLE LNMBS | NaN | 275478 | APPAREL | DENIM PANTS | DENIM PANTS | NaN | NaT | NaT | 0.0 | 0.0 | 0.0 | 1A5 | 1A5 LANE MB STR | FE | TN | TN | NaN | 20200630.0 | 0.0 | 0.0 | 1 | 1 | 0.70 | 0.00 | 0.00 | 2824 | FE | 2 | 75 | 44 | 2 | 2 | 75 | 47 | 8 |
| 39316 | brand_389 | 202-5 | CARTON D ARBITRE LOT DE 5 | NaN | 132994 | FOOT | UNISEXE ADULTE | EQUIPEMENT ARBITRE | NaN | NaT | NaT | 0.0 | 0.0 | 0.0 | NaN | BLANC | NaN | CN | CN | GLEIZE | 20181231.0 | 0.0 | 0.0 | 0 | 0 | 0.00 | 0.00 | 0.00 | TU | UN | 1 | 31 | 18 | 0 | 1 | 32 | 99 | 4 |
| 39317 | brand_152 | 57502669I | ICEPEAK BAUTZEN | ICEPEAK BAUTZEN | OUTDOOR ADVENTURE | SHORT | NaN | NaN | 2020-06-01 | 2020-09-13 | 0.0 | 0.0 | 0.0 | 290 | 290 ANTHRACITE | HO | FI | CN | NaN | 20200608.0 | 0.0 | 0.0 | 1 | 1 | 0.13 | 0.15 | 0.00 | 46 | HO | 2 | 64 | 70 | 1 | -1 | -1 | -1 | -1 | |
| 39318 | brand_329 | CSECURUN12 | SEMELLES RUN CUSTOM | NaN | 100981 | RUNNING | UNISEX | SEMELLE | NaN | 2019-02-01 | 2019-12-31 | 0.0 | 0.0 | 0.0 | NaN | NS | NaN | MA | MA | NaN | 20190901.0 | 0.0 | 0.0 | 0 | 0 | 0.00 | 0.00 | 0.00 | L | UA | 1 | 46 | 98 | 1 | 1 | 0 | 98 | 1 |
| 39319 | brand_17 | 2032B756 | KATAKANA GRAPHIC TEE | NaN | 278135 | TRAINING | FEMME | KATAKANA GRAPHIC T | NaN | 2021-01-15 | 2037-12-31 | 0.0 | 0.0 | 0.0 | 002 | 002 PERFORMANCE BLACK/BRILLIANT WH | NaN | NaN | NaN | NaN | NaN | 0.0 | 0.0 | 1 | 0 | 0.00 | 0.00 | 0.00 | L | FE | 2 | 0 | 12 | 5 | 2 | 78 | 13 | 5 |
| 39320 | brand_1 | FM9969 | ESSENTIAL TEE | NaN | 275124 | SPORTSTYLE | HOMME | 27-T-SHIRT (SHORT SLEEVE) | NaN | 2020-05-01 | 2020-11-30 | 0.0 | 0.0 | 0.0 | 095A | 095A BLACK | NaN | NaN | NaN | NaN | NaN | 0.0 | 0.0 | 1 | 0 | 0.00 | 0.00 | 0.00 | L | HO | 2 | 0 | 12 | 4 | 2 | 75 | 12 | 4 |
| 39321 | brand_150 | 208997 | DEUCE COURT CANVAS | NaN | 375961 | LOISIR | CHAUSSURE | BASSE | NaN | 2020-07-01 | 2020-12-31 | 0.0 | 0.0 | 0.0 | 2001 | BLACK | HO | CN | CN | NaN | 20191126.0 | 0.0 | 0.0 | 1 | 1 | 0.90 | 0.00 | 0.00 | 36 | HO | 3 | 75 | 95 | 7 | 3 | 75 | 96 | 1 |